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Abstract. We introduce a class of variational principles on measure spaces which 
are causal in the sense that they generate a relation on pairs of points, giving rise to a 
distinction between spacelike and timelike separation. General existence results are 
proved. It is shown in examples that minimizers need not be unique. Counter exam- 
ples to compactness are discussed. The existence results are applied to variational 
principles formulated in indefinite inner product spaces. 
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Causal variational principles on measure spaces arise in the context of relativistic 
quantum theory. But they are also interesting from a purely mathematical perspective 
as a class of nonlinear variational principles whose minimizers have a surprisingly rich 
and so far largely unexplored structure. The goal of the present article is to give a 
mathematical introduction to these variational principles and to develop the existence 
theory ( Chapters [T] and [2]). The physical applications will be obtained by reformulating 
the variational principles in indefinite inner product spaces (Chapters [3] and H]). Our 
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results will be illustrated by a number of examples and counter examples, which also 
show that the minimizers are in general non-trivial and not unique. 

1. The Causal Variational Principle with Two Prescribed Eigenvalues 

1.1. Introduction and Basic Definitions. In order to give an easily accessible in- 
troduction to the basic ideas and methods, we begin with the simplest interesting 
example, the so-called causal variational principle with two prescribed eigenvalues 
(the general systems will be introduced in Chapter [21). Let (M, //) be a measure space 
of total volume /^(M) = 1. For a given parameter (3 £ [0, 1) and an integer / > 2, 
we let 3^ C Mat(C'^) be the set of all Hermitian / x /-matrices of rank at most two, 
whose non-trivial eigenvalues are equal to 1 and —f3. Consider the set of matrix-valued 
functions 

M = {F : M ^ 3" measurable} . 
For any F G M and x,y £ M, the matrix product 

A,y = Fix) ■ F{y) (1.1) 
is of rank at most two. Thus counting with algebraic multiplicities, its eigenvalues are 

Xl\Xl\0^_^ with A^^gC 

/ — 2 times 

(note that the matrix A^y is Hermitian only in the special case that F{x) and F{y) 
commute, and thus the A^^ will in general be complex). Clearly, the functions are 
measurable in x and y. Thus introducing 



the Lagrangian ^[^xy] = 


5 




the action S[F] = 


.// 






J JMxM 



we obtain a non-negative functional S on M. Our variational principle is to 

minimize 5 on M . 
We are interested in the following questions: 

• What is the infimum of the action? Is the infimum attained? 

• Provided a minimizer exists, what is its regularity? Is the minimizer unique? 
What is the structure of the minimizers? 

Before addressing these questions, we explain the form of the Lagrangian and discuss 
a few properties of our variational principle. We first point out that, since we prescribed 
its eigenvalues, every matrix in 9" has sup-norm one, and thus 

|A^^|<||F(x)F(y)||<||F(x)||||F(y)||=l. 

Hence the Lagrangian is bounded, C £ L°°[M x M, M), and the action is finite. Next, 
the transformations 

^xy ^ ^xy ^ iv(_p(2;)F(y)) = Tr(F(y)F(x)) = Tr(F(x)F(y)) = A^ + a5 

show that the X^ and A are either both real, or else they form a complex conjugate 
pair. This distinction gives rise to a notion of causality. 

Definition 1.1. Two points x,y £ M are called timefike and spacefike separated if 
the roots X^ of the characteristic polynomial of A^y are real or non-real, respectively. 



CAUSAL VARIATIONAL PRINCIPLES ON MEASURE SPACES 



3 



Let us verify that this notion is symmetric in x and y. The first method is to note 
that the are uniquely determined as the solutions of the two equations 



Substituting and cyclically commuting the arguments of the traces, one sees 

that the traces are invariant under exchanging x and y. In other words, the matri- 
ces Axy and Ayx have the same spectrum, showing that our notion of causality is 
indeed symmetric in x and y. Alternatively, this can be seen from the matrix iden- 
tity det{BC - Al) = det(CB - Al) (see for example [6l Section 3]). 

If x and y are spacelike separated, we just saw that the \^ form a complex conjugate 
pair. Hence the absolute values of \^ and A^^ coincide, and thus the Lagrangian C 
in (jl.2p vanishes. In other words, pairs {x, y) with spacelike separation drop out of the 
Lagrangian and thus do not enter the action. We refer to this fact that our variational 
principle is causal. This causality can be seen in analogy to relativity, where space-time 
points with spacelike separation cannot influence each other via the physical equations. 
This analogy will become clearer in Chapter [3l when variational principles in indefinite 
inner product spaces are considered. Until then, we shall focus on the mathematical 
properties of our variational principle. 

Qualitatively, since the Lagrangian vanishes for spacelike separation, our variational 
principle tries to achieve that as many pairs of points as possible have spacelike sep- 
aration. On the other hand, in the special case F{x) = F{y), the matrix Axy has 
the non-zero eigenvalues 1 and showing that x and y will have timelike separa- 
tion if F(x) and F{y) are sufficiently close to each other. Thus there are competing 
mechanisms, and this will lead to mathematically interesting effects. 

Another point of mathematical interest is that our variational principle generates 
mathematical structures on M. Note that (M, /x) is merely a measure space, but 
we do not assume a topology. But a given minimizer F induces a topology on M 
(namely F~^((D), where denotes the set of all open subsets of 9"), and furthermore F 
induces the causal structure of Definition 11.11 Thus a minimizer generates on M a 
topological and causal structure. In order to better understand this structure formation, 
one needs to clarify the freedom in choosing the minimizers of the variational principle. 
In particular, if our variational principle allowed to distinguish a specific minimizer F 
(determined modulo isomorphisms of the measure space (M, fi) ) , this would give rise 
to a canonical topology and a canonical causal structure on M. 

1.2. Existence of Minimizers. This section is devoted to the proof of the following 
general existence theorem. 

Theorem 1.2. There is a function F G M such that 



The most obvious idea for the proof is to try the direct method of the calculus of 
variations. Thus let G M be a minimizing sequence, i.e. 



The proof would be completed if we found a convergent subsequence Fn^, and could 
prove that S was lower semi-continuous. The following consideration explains why 
this method does not seem to work. If our subsequence converged in the weak sense, 
Fn,, —r F, the spectral properties of the matrices F^^, (x) could not be controlled in 



A^^ + X^_y = Tt{Axy) and (A^^)^ + {X^Jf = Tr {A^) . 



S[F] =miS . 
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the limit. Thus the matrix F(x) would in general no longer have the eigenvalues 1 
and — /3, and thus F would not be the desired minimizer. This explains why for a 
useful notion of convergence in J" it seems necessary to consider the topology induced 
by the sup-distance function 

d{F,G) = snp\\F{x)-G{x)\\, (1.3) 

where || . || is a matrix norm on 3". Now suppose that (p is an isomorphism of the measure 
space {M,ijl) (i.e. a measure preserving bijection of M). Then for any given Fq G M, 
the function Fq o (j) \s again in M, and both functions have the same action. More 
generally, S is constant on the orbit U of Fq under the action of such isomorphisms, 

U := {Fq o I (/) isomorphism of (M, ^u)} C M . (1.4) 

The problem is that the orbits (II. 4p are in general not compact in the topology (II. 3p . 
To see this in a simple example, we take M = [0, 1) with the Lebesgue measure 
and choose -Fq £ M as a function which takes two different values p,q G J', being 
constant on the intervals [0, |) and [|,1). We consider the one-parameter family of 
isomorphisms (j)x{x) = {x + X) mod 1 with A G [0, 1). Then the functions F\{x) : = 
FQO(j)\ are all in U, but for any A 7^ /x their distance is a non-zero constant, d{F\,F^) = 
d{p, q) > 0. Hence there is even an uncountable family of functions in U which has no 
convergent subsequence. 

Our method to avoid the above problem is to translate the functions Fk into mea- 
sures pk on as we now explain. We first note that in the case < /3 < 1, every 
point p € 3' is a f X /-matrix, which is characterized by the two orthogonal eigenspaces 
corresponding to the eigenvalues 1 and — /3. Characterizing p equivalently by the first 
eigenspace and the linear span of both eigenspaces, we can identify p with a point of 
the flag manifold J'^'^(C'^) (for the detailed definition we refer to [111 Chapter I, §3.1]). 
Likewise, in the case /3 = 0, every point p G J" is characterized by the eigenspace 
corresponding to the eigenvalue one, and thus 3" can be identified with the Grass- 
mannian 3^^{Cf). In each case, this identification is useful because it makes 3^ into 
a smooth compact manifold. Moreover, 3" is a homogeneous space, meaning that the 
mapping 

p UpU-^ with U G U(/) (1.5) 

defines a transitive action of the group U(/) on 3". We introduce on 3" a Riemannian 
metric g which is invariant under this group action and denote the corresponding 
invariant measure by ^j-. For simplicity, we normalize g such that /i3^(3^) = 1- Taking 
the infimum of the lengths of curves gives on 3" a distance function 

d:3-x3-^R+. 

The topology of 3" is generated by the open balls B^{y) of distance radius e centered 
at 2/ G 3". 

Next to any F G M we introduce a measure /o on 3^ by defining that C 3" is 
measurable if and only if F~^{Q.) C M is measurable and by setting 

p{Q) = i,{F-\n)) . (1.6) 

Clearly, /9(3") = f^{M) = 1. The advantage of working with p is that it does not depend 
on isomorphisms of (M, p) , as the simple calculation 

pi{F o 0)-i(J])) = ^,{r\F-\n))) = M(F-i(J7)) 
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shows. Furthermore, our action can be expressed in terms of the measure p by 

S{p)= [[ C[p-q]dpip)dpiq), (1.7) 

where p ■ q is the matrix multiphcation of elements of 9". Our strategy is to first 
construct a minimizer of (jl.7p and then to construct the corresponding minimizer F 
of the original variational problem. 

Let C°(J) be the Banach space of continuous functions on 9", equipped with the sup- 
norm. We consider every measure pk according to Pk{f) = Jjfdpk (with / G C'^(3")) 
linear functional on C^{'J). The relations 

\Pk{f)\ < 11/11 = 11/11 , Pk{i^) = = 1 (1.8) 

(where : 9" — > R is the constant function one) yield that the pk are continuous 
functionals and ||/Ofc||cO(?)* = 1- The positivity of the measures pk is expressed by 

Pkif) > for ah / G C°(:T) with / > . (1.9) 

The Banach-Alaoglu theorem [14J yields a subsequence, for simplicity again denoted 
by Pk, which converges in the weak-*-topology; that is, for every / G C^{3^) the series 
Pkif) converges. Thus by p{f) = linik Pk{f) we can define a functional on (7*^(9"). By 
the Riesz representation theorem |15j . there is a regular Borel measure p on 9' such 
that 

Pif)= [ fdp for ah / G CO(g-) . 

From (jl.Sp one sees that p is normalized to p(9") = 1. Furthermore, taking the 
limit n — > 00 in (jl.9p one sees that p is a positive measure. We conclude that there is 
a subsequence pk and a positive normalized regular Borel measure such that 

/ fdpk^ [ fdp for all / G C7°(3-) . 

Since the function JZ\p ■ q] with C according to (|1.2p is continuous in both argu- 
ments p, g G 3", we conclude that S{pk) — > <S{p). We have thus proved the following 
result. 

Lemma 1.3. There is a positive normalized regular Borel measure p on 3^ such that 

[[ C[p-q]dpip)dpiq)=iniS[F]. 

The remaining step is to "realize" the measure /o by a function F G M as follows. 

Lemma 1.4. There is a measurable function F G M such that the measure p of 
Lemma \1.3\ has the representation 

p(0) = p{F^^{0,)) for every Borel set C J . 

Proof. We recall that a measurable set C M is called an atom if p{0,) > and if every 
subset K C 0, with p{K) < /x(0) has measure zero (cf. [lOl Section 40]). A measure 
is said to be atomic if every set of non-zero measure contains an atom. Conversely, 
a measure is non-atomic if it contains no atoms. The measure p can be decomposed 
into the sum of an atomic measure {p'^,M'^) and a non- atomic measure {p^,M^) in 
the sense that 

M = U M" and p = p"^ + p" (1.10) 
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(see |12) for a proof in general measure spaces). Furthermore, an exhaustion argument 
(see Section 42 (2)]) shows that 

For all Q G [0, //''(M'^)] there is a measurable set Q C M"" with /i'^(r2) = a . (1.11) 

In order to treat the atomic part, we denote the atoms of the measure {iJ,'^,M'^) 
by 21. Since every element ^ G 21 has measure p'^{A) > and 

/(A) = ^,{M'') < 1 , 

the set 2t is at most countable. Furthermore, by modifying the functions Ff. on sets 
of measure zero we can arrange that for every ^4 G 21, the set -Ffc(j4) consist of just 
one point, for simplicity again denoted by i<fc(A). For any j4 G 21 we consider the 
series Fk{A) G 9". Since 3' is compact, this series has an accumulation point. Using 
that 21 is at most countable, a diagonal series argument yields that, possibly after 
choosing again a subsequence of {Fk)nmi for every ^ G 21 the series Fk{A) converges 
in 9" as A; — > oo. Setting f{A) = limk^oo Fk{A), it follows that for every e > 
and y G 9", 

p{BM) = lim Pk{Be{y)) > /^'(^) • 

Ag2l with f{A)=y 

Since p is a regular Borel measure, we can let e — > to obtain the same bound 
for p(\yY)- Hence subtracting a sum of Dirac measures, 

/>^ = />-^/(^)5;(^), (1.12) 

the resulting measure p'^ on 9" is again positive. Defining the function F on by 

Fl^^d : M^^J : A^ f{A) , 

it remains to construct a function g := F\m'' ■ M'^ 3' such that 

p^(0) = p%g-^{n)) for every Borel set C 3" . (1.13) 

To handle the non-atomic measure we proceed by induction in A:. In the first 
step A; = 1, we decompose 9" into a disjoint finite union of non-empty Borel sets of 
bounded distance diameter, 

3" = g'Ju • • • U3"/^^^(^) with diam{9-j) < 1. 

Since p'^ is non-atomic and 

Ae2t 

we can apply (jl.lip iteratively to decompose M'^ into a disjoint union of measurable 
sets Ml, . . . , ^-^^ with p^{Mf ) = p'^{Fl). Of every set S'j we choose a point yf 

and define a step function gi : ^ J by gi\j^.ji = yj, 

^ — li-''7^max(l)- 

For the iteration step (A; — 1) — > A; we decompose each of the sets 3"?~^, . . . , J'f -.n 
into sets of smaller diameter to obtain a decomposition of 9" of the form 

J = U • • • Ujf_(,) with diam(yf ) < i. 
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Again using (jl.lip . we subdivide the sets Mj^ ^ into smaller sets M^, . . . , Mj' with 
the property ji'^iM}') = p'=(3"f). Of every set 9"^ we again choose a point and define 
a step function gk : M"^ ^ 3" by gk\M!^ = , I = ■ ■ ■ , ^max(^)- 

This inductive procedure gives a series of step functions {gk)ke'N- By construction, 
this series converges uniformly to a measurable function g : M*^ 3". Furthermore, 
for any Borel set C 9", 

fi%g~'m Yl /^'(^') = E p'^^'^ ^ ^'(^) • 

yfen yfen 

Thus g satisfies (|1.13p . completing the proof. □ 

This completes the proof of Theorem 11.21 

1.3. Non-Uniqueness in the Discrete Setting. We now discuss the uniqueness 
problem in the case when the measure n is discrete. We begin with the simple example 
where M = {1} consists of only one point. In this case, every function F G M can be 
identified with a point F{1) G 9". All these functions give the same value S = ^(1-/3^)^ 
of the action, and thus the minimizer is clearly not unique. However, since according 
to (jl.Sp the U(/)-symmetry group acts transitively on 9", all the minimizers can be 
obtained from each other by a suitable U(/)-transformation. Thus the minimizer is 
unique up to U(/)-transformations on 3". 

The next example gives a connection to a problem already studied in the literature 
and gives a good intuition for the mechanisms in our variational principle. 

Example 1.5. (Maximizing distances of points on S"^) We choose / = 2, /? = and 
let M = {1, . . . ,m} be a finite set with the normalized counting measure /i({l}) = 
• • • = /i({m}) = — . Then every F{x) is a Hermitian 2 x 2-matrix with eigenvalues 1 
and 0. Representing F[x) as a linear combination of the identity matrix and the three 
Pauli matrices a defined by 

1 /O 1\ 2 /^O -A 3 /I 0\ 

^ = Vi oj' ^ = Vi oj' ^ =1^0 -ij' 

we obtain 

F{x) = - (1 + Vx^) , where \vx\ = 1 ■ (1-14) 

In this way, every F{x) can be described by a unit vector Vx G S"^. The Lagrangian is 
computed to be 

C[Axy] = \{l + Vx-Vyf . (1.15) 

o 

In particular, all points have timelike separation. Furthermore, the Lagrangian be- 
comes smaller if the angle between Vx and Vy is larger. Thus qualitatively speaking, our 
variational principle attempts to maximize the angles between the vectors {vx)x=i,...,m- 
In other words, the variational principle tries to distribute m points on the sphere, 
maximizing their distances according to the "repulsive pair potential" (|1.15p . This 
problem has been studied for a variety of potentials; see [16] for a review. 

Again, the minimizer cannot be unique, because the action of U(/) will give rise 
to different minimizers. Furthermore, other minimizers are obtained by permuting 
the points of M. Even if we consider the problem modulo the action of U(/) on J" 
and permutations in M, in general the minimizers will still not be unique. Namely, 
as discussed in [16], the discreteness of the problem will in general give rise to a 
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complicated geometric structure on the sphere, leading to many minima of the action. 



The following example illustrates that the causal structure can be a further source 
of non- uniqueness. 

Example 1.6. (Non-uniqueness for point distributions on S"^) We choose / = 2, 
(3 G (0, 1) and let M = {1, 2} consist of two points, again with the normalized counting 
measure. Representing the matrices F{x) similar to (jl.l4p by a linear combination of 
Pauli matrices, 

F{x) ='^-^l+'^-^Vxa with = 1 , (1.16) 

a short calculation shows that the points 1 and 2 are spacelike separated if and only if 

+ 6/3-1 

In this case, only the diagonal terms A^^ contribute to the action, and thus 

S = \{C[A,r]+C[A22]) = \{l-P^f . (1.17) 

Noting that the values of £[ylxx] are already determined by the given parameter (3 but 
are independent of the choice of the matrices F{x) G 9", we see that ()1.17p is even the 
infimum of the action. We conclude that there is a continuous family of minimizers. 
A short calculation shows that the eigenvalues of the matrix A12 depend on the angle 
between vi and vi, and thus the minimizers for different values of this angle cannot 
be U(/)-equivalent. 

We conclude that if the measure /i is the counting measure, we cannot hope for 
uniqueness, because the discrete nature of the problem will give rise to a complicated 
geometric structure with many minima of the action. The situation will be similarly 
involved if ^ consist of both discrete and a continuous parts. 

However, the above examples give us hope that the uniqueness problem might sim- 
plify if the measure is purely continuous. Namely, in this case the variational principle 
with Lagrangian (jl.lSp should have a unique minimizer, obtained by distributing the 
continuous measure uniformly on the sphere. One might conjecture that the same 
measure should also be the minimizer in the case /? > 0. Also, can one expect a 
unique minimizer in the case / > 2? Motivated by these specific questions, we now 
turn attention to a more systematic study of the uniqueness problem in the continuous 
setting. 

1.4. The Continuum Variational Principle and its Hilbert Space Formula- 
tion. In the remainder of this chapter we shall assume that is a non-atomic measure 
on M (see [10^ Section 40] or the proof Lemma [l.4p . In this case, we can conveniently 
reformulate our variational problem purely in terms of positive normalized regular 
Borel measures on 9", denoted in what follows by B{y). Namely, suppose that we 



minimize S{p) = jj C[p • q] dp{p) dp{q) onB{3^). (1-18) 

Then, since every F G M gives rise to a corresponding measure p on 9" (see (II. 6p ). it is 
obvious that infg^gjyj > infpgg(gr) 5(/9). Conversely, the inductive construction in 
the proof of Lemma [1.4l shows that every p £ B{F) can be realized by a function F G M, 
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and that this F is unique up to isomorphisms of the measure space (M, ^u). Hence 
in what follows it suffices to consider the variational principle (jl.lSp . referred to as 
the continuum variational principle. The existence of minimizers is an immediate 
consequence of Lemma 11.31 

Corollary 1.7. The infimum of the continuum variational principle (jl.lSp is attained. 

In order to get into the position to apply spectral methods, we let !K = L^(3", dfi^) 
be the Hilbert space with scalar product 



If we assume that the measure p in (jl.lSp is so regular and bounded that it has a 
Radon-Nikodym decomposition (see for example [lOl Section 31]) 

dp = ijdn^ with ipeJi, (1.19) 

then the action can be expressed as an expectation value 

where L is the integral operator defined by 

(LV')(p) = JcIp- q] ^{q) dfi^iq) . (1.20) 

The conditions that p be positive and normalized can be expressed by demanding 
that ip > and that = 1 (Ig^ again denotes the constant function one). Hence 

the variational principle (11.181) can be reformulated in Hilbert spaces by 



minimize S^ip) = {ip^Lifj) for ip £ "K with > and (ipjlj-) = 1 . (1-21) 



We remark that the constraint > is unusual in the Hilbert space setting. In 
particular, (jl.2ip is much different from minimizing a Rayleigh quotient. Furthermore, 
we point out that the representation p.l9p poses a strong condition on the Borel 
measure p. However, the measures satisfying this condition are dense in B{3^) in the 
C'^(9")*-topology. Therefore, the infima of p.l8p and (|1.2ip coincide. But it is not clear 
whether the variational principle (ll.2ip has a minimizer. In cases where the answer 
is yes, this means that there are minimizers p of (|1.18p satisfying (|1.19p . If every 
minimizing sequence of ()1.2ip had a convergent subsequence, we could even conclude 
that every minimizer p of (I1.18P satisfies (I1.19p . 

1.5. Non-Uniqueness and Non- Triviality of Minimizers. In this section we shall 
use spectral methods to prove the following result. 

Theorem 1.8. For the continuum variational principle in the Hilbert space formula- 
tion (jl.2ip the following holds: 

(i) In the case P = 0, the constant function 1^ is a minimizer, 

(l:^,Llj)= mi S{p). 

There is an infinite-dimensional family of minimizers. 

(ii) In the case < (3 < 1, the function Ig^ is not a minimizer, 

{l:?,Ll^) > inf S{p). 
The minimizer p G B{3') from Corollary \1.7\ is not unique. 
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Moreover, in the case (3 = and / = 2, we give an explicit example of a minimizer p G 
B(3") which does not have the representation (jl.lOp with ip £ (see Example ll.lip . 
In preparation for the proof, we compile a few spectral properties of the operator L. 

Lemma 1.9. L is a compact self-adjoint operator on !K. The eigenvectors correspond- 
ing to non-zero eigenvalues are continuous functions on 3^. The sup-norm \\L\\ is a 
non- degenerate eigenvalue, and the constant function Ij- is the corresponding eigen- 
vector. 

Proof. Clearly, the kernel C [p q] of L is a continuous function on 3" x 9". Using further- 
more that y is a compact manifold, the operator L is obviously bounded. Moreover, 
the kernel C[p ■ q] is real- valued and, as explained after Definition 11.11 it is symmetric 
in its two arguments. This implies that L is self-adjoint. The integral estimate 

Tr(L*L) = // £[pq] C[qp] dp^{p) dfi:,{q) < fi{3^f snp\£[pqf 

shows that the operator L is Hilbert-Schmidt, and thus compact. This means that its 
the spectrum it(L) C M is purely discrete, bounded and accumulates at most at zero. 
The spectral theorem gives a decomposition 

Ae(T(L) 

where the Ex are projectors onto finite-dimensional, mutually orthogonal eigenspaces. 

Suppose that ^ is an eigenvector of L corresponding to an eigenvalue A 7^ 0. Then ip 
can be written as 

Regarding the right side as a convolution of tp with a continuous kernel, one sees that ip 
is a continuous function. 

The Lagrangian (jl.2p is U(/)-invariant, meaning that C[pq] = C[UpqU^^] for ev- 
ery U € U(/). Using furthermore the U(/)-invariance of the integration measure d^s^, 
we obtain 

(Ll^)(p) = [ C[pq] diji^iq) = [ C[pU-\U] dpj{q) = {Ll^){UpU-^) . 

Since U(/) acts transitively on 9", we conclude that the function L(lg-) is constant. 
Hence Ig^ is an eigenvector. 

The idea for completing the proof is to note that the operator L has a non-negative 
kernel (cf. ()1.20p and (|1.2p ') and to apply the Perron- Frobenius theorem (see [171 Chap- 
ter 5] for matrices and [8, Section 3.3] for integral operators). Unfortunately, this the- 
orem cannot be applied in our setting, because it requires that the kernel be positive 
almost everywhere, whereas our kernel -^>[p(/] vanishes whenever the points p and q are 
spacelike separated. But we can adapt the Perron-Frobenius method as follows: First, 
the estimate 

— inf(T(L)= sup — (0, L0)< sup ( |</)|, L|(/)| ) < sup a{L) 

(/iGlK, 11011=1 11011=1 

shows that the supremum of the spectrum coincides with the sup-norm. Hence, using 
that L is compact, ||L|| > is an eigenvalue corresponding to a finite-dimensional 
eigenspace. Suppose that ^ is a corresponding normalized eigenvector. Then 

||L|| = (^,L^)<(|V'|,L|V|><||^||, 
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showing that {ipl is again an eigenvector corresponding to the eigenvalue ||L||. This 
eigenvector is not orthogonal to I5-, because 



This imphes that the two eigenvectors must be in the same eigenspace, and thus 
Llj.= \\L\\l^. 

We now proceed by contradiction. Assume that the eigenvalue ||-L|| is degenerate. 
We choose a vector ip in the corresponding eigenspace which is orthogonal to Ig^. Since 
the real and imaginary parts of an eigenfunction are again eigenvectors, we can arrange 
that is real- valued. Then the orthogonality (ipjl^r) = implies that ip changes sign. 
Since ip is continuous, its zero set must be non-empty. We choose a point po on the 
boundary of the zero set. Then ipipo) = 0; but ip is non-trivial on any neighborhood 
of pq. Since the Lagrangian is continuous and £[poPo] > 0, there is a point qq in a 
neighborhood of po where C\pQ qo] > and '(/'('Zo) / 0. Taking the linear combination 

we have constructed an eigenvector corresponding to the eigenvalue ||L|| such that 

(j){po) = 1 , (piqo) = -1 and C[po go] > . 

Using continuity, it follows that the function C[p q] (l){p) (p^q) is strictly negative on a 
set of non-zero measure. Hence 

||L|| ||</)f = L0) = // £[pq]^{p)<p{q)d^i^{p)d^i^{q) 

< [[ £[pq]\<P{p)\\^{q)\dfi:^{p)dfi:j{q) = {\<p\,L\(P\) , 

a contradiction. □ 

We can now characterize the structure of the minimizers of the continuum variational 
principle depending on spectral properties of L. We distinguish between three cases: 

(A) L > and kerL = {0}: 

We choose an orthonormal eigenvector basis (V'n)neN with Lipn = Xntpn, or- 
dered such that ipi = Ip and Ai = ||L||. Expanding the vector ip in (jl.2ip in 
this basis, the condition {ip,lj) = 1 implies that the coefficient of ipi equals 
one, 

ip = ipi + '^Cn'ipn with ^|c„p<oo. (1-22) 

n=2 n=2 

Using furthermore that all eigenvalues are strictly positive, we obtain the esti- 
mate 

oo 

{^P,L^P) = Xl + J2^n\cn\^>Xl, (1-23) 

n=2 

and equality holds only if all the coefficients C2, C3, . . . vanish. This shows that 
the constant function is the unique minimizer. 

(B) L > and the kernel of L is non-trivial: 

Again representing ip in the form (|1.22p . the inequality (jl.23p again holds, but 
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we have equality if and only if x/j — ipi S ker L. Since we must take into account 
the condition ip >0, the set of all minimizers in % is given by 

(V'l +kerL) □{'0 G JC.V' > a.e.} . (1.24) 

Thus the constant function is again a minimizer, but there are other, nontrivial 
minimizers. If the kernel of L is finite-dimensional, the functions in (jl.24p 
are all continuous, and thus we can say that all minimizing measures of the 
variational principle (jl.lSp have the representation (jl.l9p with a continuous 
function However, if L is infinite-dimensional, in general there will be 

sequences in ()1.24p which do not converge in "K but do converge in (7*^(3")* to a 
non-trivial Borel measure. Thus in this case we can expect minimizers of (jl.lSp 
which do not admit a representation (|1.19p with -0 G J{ (for an example of such 
a minimizer see Example II. lip . 
(C) L has negative eigenvalues: 

The inequality (|1.23p can be violated by choosing the coefficient c„ correspond- 
ing to one of the negative eigenvalues to be non-zero. Hence the constant 
function is no longer a minimizer. There seems no general reason why a mini- 
mizing sequence should converge in J{. Thus we cannot expect that there are 
any minimizers in J{. But according to Corollary II. 7^ the minimizers will still 
exist in the sense of measures. Since these minimizers necessarily break the 
U(/)-symmetry, the action of U(/) (see page H]) will give rise to other mini- 
mizers. Hence the minimizer p G i3(5') as obtained from Corollary 11.71 is not 
unique. 

Following these arguments, it remains to analyze the spectrum of L. We begin with 
the case (3 = 0. 

Lemma 1.10. In the case /? = 0, the operator L is non-negative. Its rank is finite 
and bounded by 

dim L (IK) < f . 

Proof. In the case /3 = 0, every p G 3" is a / x /-matrix of rank one, having the non- 
trivial eigenvalue one. We denote a normalized eigenvector of p corresponding to the 
non-trivial eigenvalue by u{p) G C-^. Since the matrix pq has rank at most one, the 
Lagrangian (jl.2p simplifies to 

£[pq] = ^\{u{p),u{q))^f\^ . (1.25) 

We now introduce the operator K : J{ ^ (0)* (0)* hy 

= J V'(p) u{p) (g) u{p) u{p)* u{p)* dfi3^{p) 

(where we use the natural embedding of to its dual space given by complex conju- 
gation). Then 



\\K{'ip)\\ = // tpip)^{q) {u{p),u{q))^f {u{p),u{q))^f dn:^{p) dn:^{q) 
Mo/ / r / \ 

In other words, 2L = K*K, showing that L is non-negative and that its rank is at 
most the dimension of the vector space C-^ C-^ (C-^)* (C-^)*. □ 
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We next give a more explicit analysis of the case / = 2 and /3 = 0. In this case, the 
Pauli representation (jl.l4p allows us to identify 9" with S'^. Then L is a spherically 
symmetric operator on L^(S'^). Thus it has the same eigenspaces as the spherical 
Laplacian, 

oo 
1=0 

where the Ei are the projection operators onto the eigenspaces of the spherical Lapla- 
cian corresponding to the eigenvalue /(/ + 1). The kernel of the Ei can be given in 
terms of the spherical harmonics YJ™" by 

I 

Ei{p, q)=ATT Yl^{p) yf\qj, where p,qeS^ 

m=—l 

(the factor 47r arises because our integration measure on 5^ has total volume 
one). The eigenvalues A; are most easily computed by applying L to the spherical 
harmonic Yj^, 

XlYl^ip) = {LY^rn = I C[pq] Y^{q) dfi^{q) . (1.26) 



Choosing on standard polar coordinates {'d,ip), the spherical harmonics Y^^ are 
multiples of the Legendre polynomials Pi{cos'd). More precisely, using the standard 
normalization conventions 



yA^,<^) = P/(cos^) and Pi{l) = 1 , 

we can evaluate (I1.26P for p at the north pole ?? = to obtain the simple formula 

1 

Xi = - £(cosi?)Pi(cosi?)(icost?. (1.27) 
2 J-i 

Setting /? = 0, the Lagrangian simplifies to (jl.lSp . 

£(cosi?) = - (1 + cosi?)2 (if / = 2, /? = 0) . 
8 

In this case, the integral (jl.27p can easily be calculated 
111 

^0 — , ^1 — 77^; ^2 — — , A3 — A4 — . . . — . 

6 12 bU 

Counting the multiplicities 2Z + 1 of the eigenspaces, the rank of L is computed to 

be 1 + 3 + 5 = 9, in agreement with the upper bound = 16 from Lemma ll.lOl 

This detailed information allows us to give a minimizer which is not in J{. 

Example 1.11. (A distributional minimizer) In the case / = 2 and /3 = 0, we again 
use the identification 9" ~ S*^ and choose polar coordinates {^^ip). For any parame- 
ter a G [0, 1], we define the measure p G B{'J) by 



9 dp -- 
■J J J 



a+^y^e(|cos7?| 



■J 



27r ^ ^ 

dip , 



5,( COS 1? = COST? = 



where is the Heaviside function. This is a positive normalized regular Borel measure, 
but due to the singular contributions at cosi? = it^, it cannot be represented in the 
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form ()1.19p with ^ ^ 'K. In order to show that p is a minimizer, we must verify that 
it coincides with the constant function on the nontrivial eigenspaces of L, i.e. 

p{Y^) = ^= and pi^{^) = for alU = 1, 2 and m = -/,..., L 
v47r 



The first condition is obvious because Yq = (47r)~2 is constant and p is normahzed. 
Since the spherical harmonics YJ™ are the restrictions to S"^ of polynomials of degree / 
in R^, a symmetry consideration in the (/^-integral shows that all the other conditions 
reduce to the three constraints 

p{z) = , p{2,z^ - 1) = , /)(3x2 - 1) = , (1.28) 

where (x, y, z) are Cartesian coordinates, where •& denotes the angle to the z-axis. The 
first equation in (jl.28p is immediate by symmetry, whereas the second equation follows 
from the computation 

1-a /"^^ o ... 3 



C{?,z'^-l)dz + - (1-a) (3z2-l) 
•^5 ^ 



.1=0 

" 2 



In order to verify the third relation, we first note that the symmetry around the z-axis 
yields p(3a;^ — 1) = p(3y^ — 1). Also using the last equation in ()1.28p . we obtain 

p{^x^ -l) = \ pU{x^ + y2 + ^2) _ 3\ 1 (3 _ 3) ^ ^ 



£(cos i9) = ^ (1 + cos ^) max { 0, cos i? + ) . (1.29) 



concluding the proof. 

If /3 > (and still / = 2), the situation is more interesting because of the non-trivial 
causal structure, which leads to a region where the Lagrangian vanishes identically. 
Namely, a short calculation using (11.160 yields 

The resulting integrand in (jl.27p is again a polynomial in cost?, and thus for every I, 
the function A/(/3) can be computed in closed form. For example, 

_ (l-/?)4(i + 4/? + /32) 

° 6(l + /3)2 

and similarly for the other eigenvalues (clearly, the formulas get more complicated 
for larger I, but expressions up to / ~ 20 are handled easily by computer algebra). 
On the left of Figured] the three lowest eigenvalues Aq, Ai and A2 of L are shown as 
functions of (3. These eigenvalues are always positive. On the right of Figure [H the 
next eigenvalues A3, A4 and A5 are plotted versus (3. The Taylor expansion 

shows that A3 is negative for small positive /3, and Figure [T] illustrates that this func- 
tion A3(/?) stays negative on the interval < (3 < 0.07. Thus for /3 in this range, we 
are in case (C) on page [12] where the minimizer is non-trivial. If P is further increased, 
A3 becomes positive, but then A4 is negative. If P is further increased, A4 becomes 
positive, but then A5 is negative. More generally, these plots suggest that for every /3 
in the range < /? < 1 at least one of the A/ should be negative. This is indeed the 
case, as the following lemma shows. 
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Figure 1. The eigenvalues A/(/3) of the operator L in the case / = 2. 

Lemma 1.12. In the case f = 2, 

min Xiip) < for all [3 £ (0, 1) . 

Proof. In order to analyze the asymptotics /3 y 1, we first expand the second fraction 
in ^M), 

showing that the Lagrangian vanishes identically except in a neighborhood of the 
north pole -d = 0. Thus we may also expand in powers of "i? to obtain the asymptotic 
Lagrangian 



1-^] maxfo 
1 ^ j max^O, 



:i.3o) 



In order to derive the asymptotic form of the Legendre polynomials P;(cos -d) near the 
pole, we note that these polynomials are given as solutions of the ODE 



—aT-a (smi?:^PKcosi9) ) = I {I + I) Pi {cos ^) 
sm n an \ an 



1 . 



Using the asymptotics sini? = i9 -\- 0{i9^), this differential equation simplifies to 

whose solution is a Bessel function of the first kind, 

Pf^y(T?) = Jo (71(7+1)^) . (1.31) 

Substituting (ll.SOp and (|1.3ip into (|1.27p and using the asymptotic form of the inte- 
gration measure dcos'd = (i? + 0(1!}^)) (I'd, the integral can be computed in terms of 
the confiuent hyper geometric function, 



Af y = - 
' 2 



C^y(^)p^y(^)^d^ 



where we set 



1 + /3(2 + /? + A)) oFi(3, -x) - (1 - oFi{2, -x] 



A = /(/ + l) and X = . A. 



(1.32) 
(1.33) 

(1.34) 
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Figure 2. The smallest eigenvalue of L in the case / = 2 using the 
exact formulas with Legendre polynomials and the asymptotics with 
Bessel functions. 



Next we need to specify / as a function of [3. Qualitatively, speaking, the Bessel 
function ()1.3ip oscillates with frequency of the order + 1), whereas the La- 
grangian (|1.3U|) decreases in ■!?, vanishing identically for 

Thus in order to make the integral ()1.32p negative, we choose / such that {'&) 
oscillates on [0, "i^max] just once, being negative at i? = "i^max- A good method would be 
to determine / from the equation 



^asy(^asy ~l~ 1) ''^max — 5.5 . 

Since this is in general not an integer, we introduce / using the Gauss bracket, 

:= 1 + [/asy] . 

With this choice of Z, a simple calculation shows that if /3 E [0.4, 1), the variable x as 
defined by (jl.34p takes values in the range x G [7, 12]. For [3 and x chosen in these 
intervals, a direct inspection shows that the function (jl.33p is always negative. 

Our analysis so far shows that the lemma holds if (3 is sufficiently close to one. 
On the other hand, on any interval /3 G (0, /3max] with /3max < Ij one can consider 
the explicit formulas obtained from the Legendre polynomials to verify that L has a 
negative eigenvalue. As shown in Figure [21 at /? « 0.7 the asymptotics with Bessel 
functions is a good approximation to the exact analysis. Because of this obvious fit, 
we omit rigorous error estimates of the asymptotic analysis. □ 

The case / > 2 is considerably more complicated. However, as shown in the next 
lemma, there is a general mechanism giving rise to negative eigenvalues. 

Lemma 1.13. If f > 2 and < /3 < 1, the operator L has negative eigenvalues. 

Proof. We choose p,q £ 3' having the matrix representation 

P = Pe,- pPe, , q = Pe,- PPe, , 
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where Pe^ denotes the orthogonal projection to the canonical basis vector Ci & . It 
suffices to show that the 2 x 2-matrix 

has a negative eigenvalue, as the following argument shows. Suppose that (/> G is 
a vector with {4>,l(p)^2 < 0. We consider a series V'n S 3^ which converges in the 
C°(I?')*-topology to the measure (piSp + (j)2Sq (where 6p denotes the Dirac measure 
supported at p). Then 

proving that L has indeed negative eigenvalues. 

To show that the matrix (jl.35p has a negative eigenvalue, we first compute the 
spectra of the matrix products, 

aipp) = a{qq) = {l,f)\0} , a{pq) = a{qp) = {1,0} . 

Hence 

^ = l{^' (1 ' 'i^* ^ = I - ^')' - 1) < ' 

and thus / has precisely one negative eigenvalue. □ 



This completes the proof of Theorem 11.81 

We now turn attention to a more general class of variational principles. The main 
additional difficulty will be that the target space is no longer compact. Intuitively, 
this leads to the possibility that part of the support of the measures pk "escapes to 
infinity." In order to rule this out, we must control the behavior of the measures pk 
by suitable a-priori estimates. 

2. A General Class of Causal Variational Principles 

2.1. Definitions and Statement of Results. Let {M,p) be a measure space of 
total volume p{M) = 1. For given integers /, n with f > 2n we let 3' be the set of all 
Hermitian / x /-matrices of rank at most 2n, having at most n positive and at most n 
negative eigenvalues. We let M be the set of matrix-valued functions 

M = {F : M ^ 3" measurable} . 

For a given i^" G M and any x,y G M, we again form the matrix product (jl.ip and 
denote its eigenvalues counted with algebraic multiplicities by 

Xl^...,XZ 0^_^ with XfeC. (2.1) 

f — 2n times 

We define the spectral weight \Axy\ by 

2n 

\A.y\ = Y.\^7l (2.2) 

i=i 

and similarly set \A'^ \ = Yl'^=i l-^f^P- We introduce 



the Lagrangian C[Axy] = \Aly \ - ^ \Axyf 



(2.3) 
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and define the functionals S and T by 

S[F]= [[ C[A,y]d,,ix)dfi{y) (2.4) 

J JMxM 

T[F]= [[ \A,y\^di,ix)dfi{y). (2.5) 

J JMxM 

We also introduce the following constraints: 
(CI) The trace constraint: 

/ Tv{F{x))di^{x) = f. 

JM 

(C2) The identity constraint: 

F{x) dn{x) = Icf . 

M 

(C3) Prescribing 2n eigenvalues: We denote the cigcnvahics of F[x) counted 
with multiplicities hy vi, . . . jVf and order them such that 

Vl< ■■■ <l'n < < f„+i < . . . < f2n (2.6) 

and 2^271+1 = ■ ■ ■ = I'f = 0- We introduce constants ci, . . . , C2n and impose that 

Vjix) = Cj for all a; G M and j e K . (2.7) 

We now state our results and explain them afterwards. In order to rule out trivial 
cases, we shall always assume that the set of functions satisfying the constraints is 
non-empty. 

Theorem 2.1. Imposing the constraint (C3) and in addition possibly the constraints 
(CI) or (C2), the variational principle 

minimize S[F\ on M 

attains its minimum. 

Theorem 2.2. For any parameter v with 

we consider the variational principle 

minimize T[F] + vS[F] on M , (2.9) 

possibly with the additional constraints (CI) or (C2). Then the minimum is attained 
by a function F G L'^{M, 3', dji). 

Theorem 2.3. For any parameter C > 0, we consider the variational principle 

minimize S[F] on Mc := {F G M with T[F] < C} , 

possibly with the additional constraints (CI) or (C2). Then the minimum is attained 
by a function F G Lp'{M, J, dji). 
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Before coming to the proofs, we briefly discuss the variational principles and our 
results. First of all, labeling the eigenvalues as in ()2.ip . the Lagrangian ()2.3p can be 
written as 

^[^^^] = {\^^'\ - \^T\) ■ (2-10) 

In particular, one sees that the Lagrangian is always non-negative. Furthermore, the 
Lagrangian is causal in the sense that it vanishes identically for spacelike separation 
defined as follows. 

Definition 2.4. (causal structure) Two space-time points x,y £ M are called time- 
like separated if the Aj^ are all real. They are said to be spacelike separated if all 
the Aj^, j = 1, . . . ,2n form complex conjugate pairs and all have the same absolute 
value. In all other cases, the points x and y are said to be lightlike separated. 

Theorem 12.11 is an obvious generalization of Theorem 11.21 Theorems 12.21 and 12.31 
are more interesting because the range of the admissible functions F is in general a 
non-compact subset of 9". The constraints (CI), (C2) and/or (C3) are needed in order 
to rule out the trivial minimizer F = 0. Clearly, (C2) implies (CI). To explain the 
assumptions of Theorem 12.21 we note that the inequality 



yields the following upper bound for the Lagrangian ()2.3p . 




C[A,y]< \Axy\^. (2.11) 



1 

2n 

Thus the condition (j2.8p ensures that the functional T[F] + in Theorem 12.21 is 

non-negative. If u < — , this functional is unbounded from below, as the following 
example shows. 

Example 2.5. (Ill-posedness) We consider the case n = 1 and / = 2, and choose M = 
{1,...,4} with the normalized counting measure /^({/}) = \ ^ ^ For 

any A; > we define F G M by 

Then the identity constraint (C2) is satisfied. Moreover, since the matrices A^y are all 
of rank at most one, we know that \A^y\ = \Axy\'^. Thus 

^ ^ x,y£M 



In the case v < — 2n-i i bracket is negative, and thus the functional tends to minus 
infinity as /c — > oo. Hence the variational principle (j2.9p is ill-posed. 
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The remaining border case v = — ^^"-^ will not be considered in this paper. 

To explain Theorem \2.'6\ we first note that according to (j2.1U|) . the action S in 
Theorem 12.31 is non-negative. As will be explained in the next Section 12.21 the con- 
straint T[F] < C is needed, because without an a-priori bound on T there are examples 
of divergent minimizing sequences. We also point out that the we cannot prove The- 
orem [2]3] with the alternative constraint T[F] = C. This is because when taking the 
limit of a minimizing sequence F/. F, the functional T need not converge, and indeed 
we can only prove that T[F] < limk^oo T[Fk]. This so-called "bubbling phenomenon" 
will be explained in Section 12.31 The proofs of Theorems I2.1H2.3I will be completed in 
Section [231 

2.2. Counter Examples to Compactness. We now illustrate in simple counter 
examples why the constraint T[F] < C in Theorem 12.31 is needed. In the first three 
examples, the action S[F] is uniformly bounded for a divergent family of functions F. 
In the last example, we construct an unbounded series of functions F^ : M ^ J which 
is even a minimizing sequence because S[Fk] — > 0. In all these examples we satisfy the 
identity constraint (C2). 

Example 2.6. (Divergent series with hounded action) We let n = 1, / = 2 and 
M = {1, ... 4} with the normalized counting measure. For any given parameter r G M 
we choose the function F : M ^ 3^ hy 

Obviously, these matrices all have at most one positive and one negative eigenvalue. 
Furthermore, the identity constraint (C2) is satisfied. Using that the matrices F{1) 
and F{2) are of rank one, we find 

£[An] = \ |An|2 = 128 = £[^22] , £[^12] = . 

Since the square of the matrix -F(3) is the identity, we obtain £[^33] = 0, and simi- 
larly £[^44] = £[^34] = 0. The matrix F(1)F(3) is nilpotent and and thus £[^13] = 0. 
Similarly, we find that £[^23] = £[^24] = £[^34] = 0. We conclude that 

5[F] = l(£[An]+£[A22]) = 16. 

Hence the action is bounded uniformly in the parameter r, but the matrices F[2>) 
and -F(4) diverge as r — > 00. 

This example suggests that the compactness problem could be removed simply by 
setting the divergent matrices equal to zero. The next example shows that this simple 
procedure does not work, because then the limiting configuration would in general 
violate the constraints. 

Example 2.7. (Violation of the identity constraint) We let n = 1, / = 2 and M = 
{!,... 3} with the normalized counting measure. For a given parameter r > 1 we 
choose the function F : M — > 9" as the following linear combinations of Pauli matrices: 

F[l) = 3 ^^l±l! cT^ + 31 , F(2), F(3) = -- :/I±^ a^±l v^TTT^ . 
T 2 r 2 



CAUSAL VARIATIONAL PRINCIPLES ON MEASURE SPACES 21 

A short calculation shows that these matrices all have at most one positive and one 
negative eigenvalue. Furthermore, the identity constraint (C2) is satisfied. To compute 
the action, we first note that 



F(iy = 9 ^ — 1 + 2-9^^ a'^ and thus £[Au] = r-r 

T 7 - 

Since the squares of the matrices F{2) and F{3) are multiples of the identity, we find 
that ^2(^22] = -^[^33] = 0. Moreover, using the identity for Pauli matrices 

aV = <5^^' + ie*^'^ a^ (2.12) 



we obtain 



Ai2 = cl-- a^ + ^Vl + r^(r-^ 

It I It 



with c G M. A short calculation using the anti-commutation relations for the Pauli 
matrices yields (yli2 — cf' = 0, and thus the matrix A12 has the eigenvalue c, with 
algebraic multiplicity two. Using the notion of Definition 12.41 the points 1 and 2 have 
spacelike separation, so that £[^12] = 0. Similarly one verifies that £[j4i3] = £[^23] = 
0. We conclude that 

1 1 + 

S[F] = -C[Au] = 72 



9-— - r2 ■ 
Hence in this example the action is bounded uniformly in r, although the matri- 
ces F(2) and F{3) diverge as r — > 00. Setting these two matrices to zero, the remaining 
matrix F{1) does converge, 

lim F(l) =3{a^ + l) , 

r— >oo 

but the limiting system no longer satisfies the identity constraint (C2). 

Example 2.8. (The two-dimensional Dirac sphere) We let M = C with d/i the 
surface area measure, normalized such that fJ-iS"^) = 1. Furthermore, we choose n = 1 
and / = 2. For a given parameter r > 1 we introduce the mapping F : M ^ 9' hy 

F{x)=Txa + l. (2.13) 

Then a{F{x)) = {1 -|-r, 1 — r}, and thus F{x) has one positive and one negative eigen- 
value. Furthermore, a symmetry argument shows that the identity constraint (C2) is 
satisfied. Using the Pauli identities ()2.12p . one obtains 

F{x) F{y) = {1 + t"^ xy) 1 + T {x + y)a + iT'^ {x A y)a. 

A straightforward calculation yields for the eigenvalues of this matrix 



^1/2 = 1 + COS ± rVl + cos ^2 - r2 (1 - cos i?) , (2.14) 

where "& denotes the angle "d between x and y. If i) is sufficiently small, the term (1 — 
cos I?) is close to zero, and thus the arguments of the square roots are all positive. 
However, if becomes so large that 

/ 2 
> t?max := arccosi 1 - 

the argument of the last square root in (j2.14p becomes negative, so that the X1/2 form 
a complex conjugate pair. The calculation 

A1A2 = det(F(f)F(y)) = det(F(f)) det(F(y)) = (1 + t)\1 - t^ > 
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shows that if the are both real, then they have the same sign. Hence the Lagrangian 
simphfies to 

L{A,y\ = £(cosi?) = e(i?„,ax - ^) 

= (1 + cos r) (2 - (1 - cos i?)) G(T?max - f?) • 
Using this formula in (12.4p . we can carry out the integrals to obtain 

S[F] = - £(cos??) siniJdi? = 4 (2-15) 

Similarly, the functional T can be computed to be 

T[F]=Ar\T^ -2) + 12- ^ . (2.16) 

Hence the action (|2.15p is bounded uniformly in r, although the function F, ()2.13p . as 
well as the functional T, (j2.16p . diverge as r — > oo. 

We remark that this example can be extended to the case of general even / by decom- 
posing C-^ as a direct sum of //2 copies of C2, choosing M = 5^ x {1, . . . , f/2} and 
setting 

F : S'^x{l,...,f/2} {C^)2 : (f, i) ^ 00 • • • ©00 (r fa + 1) ©0© • • • 00 . (2.17) 

^ V ' 

i"^ summand 

Example 2.9. (The three-dimensional Dirac sphere) This example can be regarded 
as an analog of Example 12.81 but in one dimension higher. We introduce the four 
4 X 4-matrices 



\ ^ „ „ , 4/0 1 

These are the Dirac matrices of Euclidean M^, satisfying the anti-commutation relations 

{f,-f^} = 25^^1 (z,j = l,...,4). 

As is verified either by direct computation or by applying the general theory of Clifford 
representations, the Dirac matrices are S0(4)-invariant in the sense that for every 
rotation R E SO (4), there is a unitary matrix U G SU(4) such that 

Uj^U-^ = rIj^ . (2.18) 

We let / = 4, n = 2 and set M = 5^ . We introduce the mapping F : M ^ 3" by 

4 

F{x) = ^rxY + 1 • 
1=1 

These matrices are S0(4)-invariant in the sense that 

F{Rx) = UF{x)U-'^ , 

where R and U are the transformations in ()2.18p . Since the unitary transformation U 
does not affect the eigenvalues of A^y, we see that >C[Aa;y] = C[AiixRy\- Thus the 
Lagrangian will depend only on the angle '& between the vectors x and y. Further- 
more, it suffices to compute the Lagrangian for vectors x and y for which the zero 
component vanishes. But in this case, the eigenvalues of A^y are calculated exactly as 



CAUSAL VARIATIONAL PRINCIPLES ON MEASURE SPACES 



23 



in Example 12.81 above. Thus the eigenvalues are again given by (j2.14p . but now each 
eigenvalue appears with algebraic multiplicity two. We conclude that 

C[A^y\ = £(cosi?) = (Ai - A2)' e(i?„,ax - ^) 

= 4r2 (1 + cos r) (2 - (1 - cos ^)) - i?) . 

Inserting this Lagrangian in (j2.4p . 



2 



vr 



max 



S[F] = - / £(cos??) sin^^d^ 







we obtain the same integral as in (j2.15|) . except that the integrand contains an addi- 
tional factor sint?. Due to this extra factor, the action decays for large r, 

S[F] = ^ + 0{r-') . 
lovr T 

Thus setting Fk = F\T-=k, we have constructed a divergent minimizing sequence. 

We finally remark that this example generalizes similar to (j2.17p to larger /, provided 
that / is divisible by four. 

2.3. The Moment Measures and the Possibility of Bubbling. In this section 
we discuss the main difficulties in proving Theorems 12.21 and 12. 3| and we will explain 
the methods for resolving these difficulties. The obvious starting point is a minimizing 
sequence F^ G M. In the setting of Theorem l2.31 we know that T is uniformly bounded, 

T[Fk] < C for ah G N. (2.19) 



In the setting of Theorem 12.21 the estimate (j2.1ip shows that (j2.19p again holds if we 
set 

C ' 



1 + mm{u, 0) ( 1 - ^ 



max(r[Ffcl + i^SlFk]) < 00 . 

keN 



In view of the freedom to act by isomorphisms of the measure space (M, /x) discussed 
after ()1.4p . it is preferable to work again instead of the mapping F with the correspond- 
ing measure p on 9" as defined by ()1.6p . Similar to ()1.7p , we can write the functionals S 
and T in terms of p, 



Sip) = If C\pq] dp{p) dp{q) , Tip) \pq\'' dp{p) dp{q) . (2. 



20) 



Moreover, the identity 



/ F{x)dpi{x)= / pdp{p) (2.21) 



allows us to also express the constraints (CI) and (C2) in terms of p. 

The main complication compared to the setting of Chapter [1] is that 3" is no longer 
compact, and thus we need to control the support of the measures pk in order to ensure 
that the limiting measure p = lim^^oo Pk again has total volume one. Moreover, it 
is no longer obvious that the functionals in ()2.20p or the integral ()2.2ip converge in 
the limit k 00. At this point, it is helpful to observe that the integrand in (j2.2ip 
is homogeneous in p of degree one, whereas the integrands in (12.20p are homogeneous 
of degree two in both p and q. This allows us to express these functions in terms of 
so-called moment measures, which we now introduce. 
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Definition 2.10. Let IC be the compact manifold 

/C = {p G 3" with \\p\\ = 1} U {0} . 

For a given measure p on 9' we define the measurable sets of IC by the requirement 
that the sets M+O = {Xp \ A G G 0} and M~J7 should be p-measurable in 3". We 

introduce the measures m^^^ , m^^-* and m^'^^ by 

m(°) (n) = i p{Rn \ {0}) + p{n n {o}) (2.22) 

m^'\n) = \j \\p\\ dp{p) - \f \\p\\ dp{p) (2.23) 
m(2)(17) = i / . (2.24) 



2 

T/ie measure m^^^ is referred to as the l^^ moment measure. 

In terms of the moment measures, the normahzation p{3^) = 1 becomes 

m(°)(/C) = l, (2.25) 
whereas the relations (|2.20p and ()2.2ip can be written as 

S{p)=if C[pq]dxvP'\p)dm^'^\q) (2.26) 

J JkxK. 

T{p)= [[ |pg|2dm(2)(p)(im(2)(g) (2.27) 
J JjCxK 

/ pdm(^)= / pdp= / F{x)dp{x). (2.28) 

Working with the m^'-* has the advantage that they are measures on the compact 
space IC. We also learn that two measures p and p whose moment measures coincide 
yield the same values for the functionals S and T as well as for the integral (j2.28p 
entering the constraints. Therefore, it is useful to consider two measures as being 
equivalent if their moment measures m^'^^ , m^^'^ and m^^^ coincide. Using this notion, 
we have a large freedom to modify the measures pk within the equivalence class defined 
by its moment measures (m^'^^ , m[,^^ , m^^'' ) . This freedom is indeed a problem for proving 
convergence of the measures pk, as the following one-dimensional analog shows. 

Example 2.11. (Discontinuous moments) We consider on M the family of measures 

Pr = -6o + ^ 6i + -^^6r , r > 1 , (2.29) 
r T — 1 — T 

where 5x denotes the Dirac measure supported at x. These measures all have the same 
moments 

Pr{R) = 1 , X dpr{x) = 1 , / dprix) = 4 , (2.30) 

Jr Jr 

and are thus all equivalent in the above sense. 
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In the limit r ^ oo, the measures pr converge in the C'^(M)*-topologj{3 to the 
measure 6i{x) dx, having the moments 



1, x6i{x)dx = l, I x^ 5i{x) dx = 1 . 

Jr Jr 



Hence the total volume is preserved in the limit, and also the first moment is contin- 
uous. But the second moment jumps discontinuously as r — > oo. 

This example reveals the undesirable fact that the limit p = lim^ p^ in the C^{3')*- 
topology may depend on how the representatives pk of the corresponding moment 
measures {m^i^\m^f}\m^^^) are chosen. In order to bypass this problem, we shall work 
exclusively with the moment measures. At the very end, we shall then construct a 
suitable representative p of the limiting moment measures. A key step for making this 
method work is the following a-priori estimate. 

Lemma 2.12. There is a constant e = £{f,n) > such that for every measure p 
on 3" the corresponding moment measures (see Definition \2. 1(J\) satisfy for all measur- 
able 0, C IC the following inequalities: 

m(i) {Q) ^ < m(°) (O) m(2) (Q) (2.31) 



^(2)(^)< (2.32) 

Proof. The inequality (j2.3ip follows immediately from Holder's inequality, 

2m^^\n) ^ <( [ \\p\\ dp{p)) < p{Rn) f \\pf dp{p) < 4m(°)(0) m^^\n) . 

To prove (|2.32p . we introduce the mapping 

(j) ■ /C X /C ^ M : {p,q) ^ \pq\ ■ 
Clearly, cj) is continuous and 

(j){p,p) = Ip^I = Tr(p^) = IIpIP = 1 . 
Thus every point r £ fC has a neighborhood U{r) C /C with 

Hp, all P,i^ f^(^) • (2-33) 

Since /C is compact, there is a finite number of points ri, . . . ,rAr such that the corre- 
sponding sets Ui := U{ri) cover /C. Due to the additivity property of measures, there 
is an index i S {1, . . . , N} such that 

m(^)(f/.)>^. (2.34) 
We write T in the form (j2.27p and apply (j2.33p as well as (|2.34p to obtain 



Tip) > [[ \pq\'dm^'Hp)dm('Hq) >lm('Hu.f > 



2^m(2)(/C)^ 



Setting e = 1/{V2N), the result follows. □ 



-'^For clarity we point out that by C° we always mean the closure of the compactly supported 
continuos functions with respect to the sup- norm. Thus C''(R) is the space of continuous functions / 
with limi^i^oo I/I = 0. 
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In view of this lemma and the a-priori bound (j2.19p . we know that the moment 
measures are uniformly bounded measures on a compact space K. Thus, exactly as in 
Section 11.21 we can apply the Banach-Alaoglu theorem and the Riesz representation 
theorem to conclude that for a suitable subsequence of the (which for simplicity 
we denote again by Fk), these measures converge in the C°(/C)*-topology to positive 
regular Borel measures, 

m«^mW (/G {0,1,2}), 

which again have the properties (|2.25p . (|2.3ip and (j2.32p . 

We next consider the Radon-Nikodym decompositions of m'^^-' and m^^^ with respect 
to m(°) (cf. [M Section 31]), 

dm(') = dm(°) + dm^ig with e L^(/C, dm^^^) {I = 1, 2) , 

where the measures c^trigl^g are singular with respect to dxn^^\ Evaluating (j2.3ip for 
any Q in the support of drrigj^jlg, the right side vanishes, and thus rrigj^^g = 0. Fur- 
thermore, the inequality (j2.3ip implies that l/'-^-'P < f^"^^- In particular, we conclude 
that /(^) even lies in L2(/C, (im(°)). Setting / = /(i) and dn^^) = (/(2) _ |/|2)dm(2) + 

(2) 

drrigjj^g, we obtain the decomposition 

(im(^) = /dm(°) , dm(2) = \f\^dm^^^ + dn , (2.35) 

where / e L2(/C, dm(°)), and n is a positive measure which need not be absolutely 
continuous with respect to m^^\ From the definition (j2.23p it is clear that / is odd in 
the sense that 

f{-p) = -f{p) for ah p G /C . (2.36) 

The remaining task is to represent the limiting moment measures m'-'^ in (j2.35p by 
a function G M and a corresponding measure p on 3'. Unfortunately, there is the 
basic problem that such a measure p can exist only if m^^) is absolutely continuous with 
respect to m^"-', as the following consideration shows. Assume conversely that m(2) is 
not absolutely continuous with respect to m^'^^ . Then there is a measurable set i7 C /C 
with m^^\lC) = and m^'^\lC) ^ 0. Assume furthermore that there is a measure p 
on 9^ which represents the limiting moment measures in the sense that (I2.22p - (12.24p 
hold. From (j2.22p we conclude that the set C 3^ has /o-measure zero. But then the 
integral ()2.24p also vanishes, a contradiction. 

This problem can also be understood in terms of the limiting sequence pk- We cannot 
exclude that there is a star-shaped region Mil C 9" such that the measures /9fc(Mr2) tend 
to zero, but the corresponding moment integrals (j2.24p have a non-zero limit. Using 
a notion from the calculus of variations for curvature functionals, we refer to this 
phenomenon as the possibility of bubbling. This bubbling effect is illustrated by the 
following example. 

Example 2.13. (Bubbling) We choose / = 2, n = 1 and again use the Pauli rep- 
resentation (I1.16P to identify IF with 5^. Furthermore, we let M = [0, 1) with p, the 
Lebesgue measure. For any parameters k > and e G (0, ^), we introduce the function 
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: M ^ 3- by 



-KE 2 cr 



^ if X < £ 



^^(x) = X < 1 + fj"'^ cos{kx) + sm{kx) ife<x<l— e 

KS 2 a iix>l — e. 

The corresponding measure ps on 3~ has the fohowing properties. On the set 

5 := {1 + wV^ + v^a^ with + {v'^f = 1} , 

which can be identified with a circle S^, pe is a multiple of the Lebesgue measure. 
Moreover, pe is supported at the two points 

_ 1 

P±:=±ff^'^' with p{{p^]) = p{{p^]) = e . (2.37) 

A short calculation shows that the identity constraint (C2) is satisfied. Furthermore, 
the separations of the points p+ and p- from each other and from S are either spacelike 
or just in the boundary case between spacelike and timelike. Thus for computing the 
action, we only need to take into account pairs (x, y) of points on S. A straightforward 
computation yields 

^/ ^ 3 ^, , 6 16^2 16k^ 

Let us consider the limit e \ 0. From (12.380 we see that the functionals S and T 
converge, 

lim5 = 3, limT = 6 + 16(K2 + K^). (2.39) 

Moreover, there are clearly no convergence problems on the set S. Thus it remains 
to consider the situation at the two points p±, ()2.37p . which move to infinity as e 
tends to zero. These two points enter the moment measures only at the corresponding 
normalized points p± = p±/\\p±\\ £ fC. A short calculation shows that the limiting 
moment measures m^'-' = lim^^o satisfy the relations 

m(0) ({p±}) = m(i) {{p±}) = but m(2) ({p±}) = > . 

Hence m(2) is indeed not absolutely continuous with respect to m*^ ^ 

In order to clarify the connection to earlier examples, we point out that, in contrast 
to Examples I2.6l and l2.7l the "bubbles" have p-measure zero, so that the total measure 
is preserved in the limit. Nevertheless, the bubbles carry non-zero second moments. 
In Example 12.111 the situation is simpler because the moments ()2.30p are independent 
of r, whereas in the present example the singularity of the moment measure appears 
only in the limit e \ 0. In particular, the moment measures (12.30p can be represented 
by a measure p (for example by choosing p according to (|2.29p with r = 2), whereas in 
the present example the corresponding representation (12.22p - (j2.24[) cannot be given. 

To avoid misunderstandings, we also point out that this example does not show 
that bubbling really occurs for minimizing sequences, because we do not know whether 
the family (-F£)o<£<i/2 is minimizing. But at least, our example shows that bubbling 
makes it possible to arrange arbitrary large values of T, without increasing the action S 
(see ()2.39p for large k). In particular, for large prescribed T, the action here is strictly 
smaller than in Example [23] (cf. (pT5|) and ([2T6]) ). 
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In order to handle possible bubbling phenomena, it is important to observe that 
the second moment measure does not enter the constraints (CI) or (C2) (see (I2.2ip 
and (j2.28p ). Therefore, by taking out the term dn in (12.35[) we decrease the function- 
als S and T (see (I2.26P and (j2.27p ). without affecting the constraints (CI) or (C2). 
The following lemma allows us to remove the term dn in (I2.35[) . 

Lemma 2.14. For every function F G M having the moment measures (I2.35P there 
is a function F € M whose moment measures m^'^ are given by 



^(0) = ^(0) 



dm(i) = / dm(o) , dm(2) = |/|2 dm(°) . (2.40) 



Proof. We first introduce the projection 



if p = 



For the function / in (I2.35P we introduce the set IC-^ C 9" and the corresponding 
projection tt-^ by 

)Cf = {f{p)p with peK,} 
TT^ : J^/C-^ : p^ f{TTK.{p)) T^icip) ■ 
These definitions are illustrated in Figure [3j Finally, we introduce the function F by 

F:M^/C^CJ:x^ TTf{F{x)) . (2.41) 
We decompose any measurable set C /C as $7 = OoU$7+ur2_, where 

no = {pe^}\ f{p) = 0} , n+ = {pe^\ f{p) > 0} , n^ = {pen\ f{p) < 0} . 

Since / is odd (|2.36p . we know that /(O) = and thus G J^o- Using the definition 
of F, (ITITD . it follows that 

pifto) = p({0}) , p(M+f]+) = p{n+) , p{R+n^) = . 
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Moreover, using (j2.22p together with the fact that the measure p is supported on K,^ , 
we find 

/ \\p\Up{p) = 2[ \f{p)\' dm^^Hp) (Zg{0,1,2}). 

jR+n+ Jn+ 

Similarly, 

\\p\\'dpip)=2 [ \f{p)\Um^^\p) . 
Using these relations, we can compute the moment measures corresponding to p by 
mW(0) = ^ p{Rn+) + ^ p(Mf]_) + p{no) 

= m(°) {n+) + m(°) (n^) + m(°) (Oq) = m(°) (0) 
m('^(f^) = ^ / Mdpip)-^ I Mdpip) 



/(p) dm(0) (p) + f(p) cim(o) (p) = / f{p) dm(0) (p) 
n+ Jq- Jn+uQ- 

rh^'H^) = \ I \\pfdp{p) = f dmW(p) . 

This completes the proof. □ 

Applying this lemma to our minimizing sequence, we shall obtain a new minimizing 
sequence Fk- The next lemma shows that the limiting moment measures of this new 
sequence can indeed be represented by a function F £ M. 

Lemma 2.15. Let F^ £ M. be a sequence of functions such that the corresponding 
moment measures xn^i^\ I G {0, 1, 2}, converge in the (JC)* -topology to moment mea- 
sures m^'^ having the Radon- Nikodym representation 

dm(i) = /dm(°) , dm(2) = |/|2 dm(°) (2.42) 

with f e L2(/C,(im(°)). Then there is a function F G L?'{M,3^ ,dp) such that the 
corresponding moment measures (as defined by ()2.22p - ()2.24p and ()1.6p ) coincide with 
the moment measures in (I2.42p . 

Proof. If the moment measures corresponding to a function F G M coincide with (j2.42p . 
it follows from Definition 12.101 that 

\Ffdp = l^llpf dp(p)=m(2)(/C) = ||/||2,(^^^^(o)^<oo, 

proving that the function is square integrable, F G L^{M,9',dp). 

Using the same notation as in the proof of Lemma 12.141 we define the measure p 
on 9" by 

p({0}) =m(°)({0}) and = 2m(°) (vrclC/ n /C^)) if C/. 

Similar as in the proof of Lemma 12.141 one verifies that the moment measures corre- 
sponding to p indeed coincide with the moment measures in (j2.42p . Thus it remains 
to represent p by a function G M. 

As in the proof of Lemma 11.41 we decompose the measure space (M, p) into an 
atomic part {M'^,p'^) and a non-atomic part {M^,p'^) (see (jl.lOp ). By modifying the 
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mappings -F^ on sets of measure zero we can again arrange that for every atom A G 21, 
the set Fk{A) consist of only one point. The estimate 

\\Fk{A)ffi'{A)< [ \\Fk{x)fdf, = mf\}C)^xn^^\)C) (2.43) 

shows that for any ^ G 21, the sequence F/^^A) is bounded in IF. Thus, just as in the 
proof of Lemma [131 there is a subsequence of the such that the sequence Fk{A) 
converges for every ^ S 21. Again subtracting the resulting limit measure according 
to ()1.12p . we obtain a positive p^. Thus it remains to construct a function g : 3^ 
which satisfies (jl.lSp . 

The method of Lemma fL^ does not immediately apply because the support of p does 
not need to be bounded. But we can use the following standard exhaustion argument. 
We introduce the sets IFl, L G No by 

g-^ = {p G :j with L < \\p\\ < L + 1} . 

Clearly, the 3'l form a partition of 9". Using (jl.lip . we can iteratively construct a 
partition Ml of M'^ such that p^{Ml) = p{S^l)- Exactly as in the proof of Lemma [1.41 
we can construct a functions ql '■ Ml — > 7l with the property 

p''{Q) = p.''{gY}{Q)) for every Borel set r2 C 3"l • 

Then the function g defined by g\ML = 9l has the required property (jl.lSp . □ 

2.4. Existence Proofs. We can now complete the proofs of the existence theorems 
stated in Section 12.11 Theorem 12.11 can be proved with the same methods as Theo- 
rem [T21 

Proof of Theorem \2.1[ The set 3^(C3) of matrices satisfying the constraint (C3) is a 
compact manifold. For a given minimizing sequence (-Ffc)fcGN the corresponding mea- 
sures pk defined by (|1.6p are supported on 9"(C3) • Exactly as in the proof of Lemma [L3l 
a subsequence of the pk converges in the {3' (^c^-^)* -topology to a positive normalized 
Borel measure on 9"(C3)- Since the integrands in (|2.20p and (j2.2ip are continuous 
in p and q, the action converges and the constraints (CI) or (C2) (if considered) are 
preserved in the limit. As in the proof of Lemma 11.41 we finally represent p by a 
function F G M. □ 

For the remaining proofs we need to use the methods of Section 12.31 

Proof of Theorems \2.S\ and \2.3[ Let {Fk)ke'N be a minimizing sequence. According to 
Lemma [2T31 to any F^ we can associate a function Fk G M. Using (|2.40p and p.35p 
in ([52B|)-([22SD, we see that 

S[Fk]<S[Fk], T[Fk]<T[Fk] and / Ffc(x) d/x(x) = / Fkix) dp{x) . 

Jm Jm 

Hence replacing the F^ by the F^, we obtain a new minimizing sequence which still 
satisfies all the constraints. As in the proof of Lemma [1.3[ the Banach-Alaoglu theorem 
and the Riesz representation theorem yield that for a subsequence of the Fk (denoted 
again by F^), the corresponding moment measures converge in the C'^(/C) *-topology 
to bounded regular Borel measures m*^'\ / G {0, 1,2}. The property (j2.40p of the F^ 
yields that the measures m*-^-' and m^^^ have the Radon-Nikodym representation (|2.42p 
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with / G L^{IC,dm^'^^). According to Lemma 12.151 we can represent the moment 
measures by a function F G L'^{M, 9", dfi), being the desired minimizer. □ 

2.5. Remarks and Open Problems. We conclude this chapter by a few remarks and 
a brief discussion of open problems. We first note that Theorems 12.21 and 12.31 remain 
valid if other constraints are imposed. For example, one could prescribe only some of 
the 2n non-trivial eigenvalues of F{x), or one could prescribe the trace of F{x). Such 
generalizations are straightforward and shall not be considered here. More interesting 
are the following open problems: 

(A) It is not clear whether Theorem 12.31 remains valid if the constraint T[F] < C 
is replaced by T[F] = C. In other words, does the bubbling phenomenon dis- 
cussed in Section [2.31 really occur for minimal sequences, or does it only reflect 
a shortcoming of our method of proof? We again point out that Example 12.131 
does not prove bubbling, because it is unknown whether the constructed fam- 
ily F^ is minimal. 

(B) It is an open problem whether Theorem l2.3l holds without the constraint T[F] < 
C if one assumes that M is a finite set and ^ the counting measure. We point 
out that the counter Example 12.91 works only for a continuous measure, but 
the situation for discrete measures is unclear. A partial result in this direc- 
tion was obtained in [6l Theorem 2.9], where an analog of Theorem 12.31 was 
proved without the constraint T\F] < C for M a finite set and jJL the counting 
measure, assuming in addition the trace constraint (CI) (see also Theorem 13.31 
below and the corresponding statement for the local correlation matrices in 
Section [3.2p . But it is an open (and seemingly difficult) problem to prove the 
same for the identity constraint (C2). 

(C) Treating the constraint in the variational principle of Theorem 12.31 with the 
Lagrange multiplier method, one finds that every minimizer F is a critical 
point of the functional S — kT for a suitable Lagrange multiplier k G M. It is 
not clear what the value of the Lagrange multiplier is, nor how it depends on 
the parameter C . What is the range of the Lagrange multipliers if C varies 
over M+? Of particular interest are the negative values of k, because this case 
cannot be handled using Theorem 12.21 

(D) Almost nothing is known about uniqueness. Clearly, we have the freedom to 
perform isomorphisms of the measure space M as well as unitary transforma- 
tions of C-^. Moreover, we are free to change the function F as long as the 
moment measures remain unchanged. But are the minimizers unique up to 
these obvious transformations, at least if is a non- atomic measure? 

(E) In the setting of Section 11.51 we saw examples of non-trivial minimizers. Do 
distributional minimizers as in Example 11.111 also exist if /? > 0? Are the 
minimizers in the setting of Lemma 11.121 or Lemma 11.131 smooth? 

(F) It would be interesting to know more about the regularity of the minimizers in 
the general setting of Chapter [2l Is every minimizing measure p supported on 
a set K,^ C 3"? Under which conditions is the set K,^ a submanifold of KP. If 
yes, in which situations is p absolutely continuous with respect to the Lebesgue 
measure on /C? When is the corresponding Radon-Nikodym derivative a con- 
tinuous or even smooth function on K,^1 

Answering these questions goes beyond the scope of this paper. We now proceed to 
the applications. 



32 



F. FINSTER 



3. Variational Principles in Indefinite Inner Product Spaces 

Let us briefly outline the physical context in which causal variational principles 
arise (for details see fS] or the review articles [HE]). The Dirac wave functions in 
Minkowski space (M, (.,.)) (or more generally on a Lorentzian manifold) have four 
complex components, and they are endowed with an indeflnite inner product ^Vl'?^^ of 
signature (2,2) (this inner product is usually written as ^tl^\(j)>- = ijjtp with -0 = ip^'y'^ 
the so-called adjoint spinor and 7" = diag(l, 1, —1, —1)). Integrating this inner product 
over space-time defines an inner product on the wave functions, 



Jm 

A system of Dirac particles can be described by an operator P which is built up of an 
ensemble of wave functions, 



where the index a runs over all the quantum numbers of the occupied states of the 
system. The operator P is referred to as the fermionic projector or, more generally, 
the fermionic operator. In [3] it was proposed to formulate the physical equations in 
terms of a variational principle for the fermionic operator in space-time. It is a main 
advantage of this approach that the metric, causal and even topological structure of 
space-time does not enter the variational principle. This makes it possible to formulate 
the physical equations on a set of points, referred to as discrete space-time. As a 
consequence of a spontaneous symmetry breaking effect [5], the fermionic operator 
induces additional structures on the space-time points, and there is some evidence 
that for systems involving many particles and space-time points, these structures give 
rise to the usual topological and causal structure of the space-time continuum [7]. 
Moreover, our variational principle can be analyzed in Minkowski space in the so- 
called continuum limit. In view of these developments, it is of interest to analyze the 
variational principle both in the discrete and continuous settings. In order to treat 
these two cases in a unified setting, M is best desribed by a general measure space. 

Analyzing variational principles in the general setting ()3.ip and ()3.2I) leads to two 
convergence problems: First, the space-time integral (13. ip need not be finite, and 
secondly the sum in (13. 2p might diverge. In order to avoid these problems, we shall 
assume that M has finite volume. Furthermore, we make the sum in (j3.2p finite by 
considering only a finite number of particles. In this chapter we shall give a general 
existence proof in finite volume and for a finite number of particles. In Chapter SI we 
will prove existence for homogeneous systems even in infinite volume for an infinite 
number of particles, assuming merely a momentum cutoff. 

Before introducing the general setup and stating our results, we mention one parti- 
cular difficulty in analyzing variational principles in indefinite inner product spaces. 
The inner product (j3.ip and also the action of any physically reasonable variational 
principle are invariant under transformations of the wave functions of the form 







a 





(3.3) 



referred to as local gauge transformations. Since the group U(2, 2) is non-compact, 
there is a large freedom to change the wave functions without affecting the physical 
action. 
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3.1. Definitions and Statement of Results. Let (M, fi) be a measure space of total 
volume /x(M) = 1 and {V, -<.\.y) a 2n-dimensional complex vector space, endowed with 
a non-degenerate sesquilinear form of signature (n, n) (for basic definitions see [9]). We 
consider the vector space H = L'^{M, V, dfi) of square-integrable functions from M to ^ 
and introduce on M the sesquilinear form 

<'4,\(j)> = / ^^l;{x)\(l){x)>- dfi{x) . (3.4) 
Jm 

Thus (if, <.|.>) is an indefinite inner product space. A function £ H has 2n 
components and is called a wave function. The parameter n is referred to as the spin 
dimension. We introduce the fermionic operator P : H ^ H as an operator having 
the following properties (A) and (Bl) or (B2): 

(A) P is symmetric in the sense that 

<PVI'/'> = <'tp\P4>> for al\ (j),^!; £ H . 

(Bl) The operator P has finite rank, dim P{H) < f < oo. It satisfies the trace 
constraint 

tr(P) = / 

(where "tr" denotes the trace of operators in H). Furthermore, the opera- 
tor {—P) is positive in the sense that 

Kipl {-P)iJ> > for ah e IK . 

(B2) The operator P is a projector on a negative definite subspace of H of dimen- 
sion f < oo. 

In case (B2), the operator P is called fermionic projector. In this case, the calculations 

tr(P) = rank(P) = / , <V'|(-P)V'> = - <V'I^V> = - <PV'I^^> > 

show that condition (Bl) is again satisfied. Thus (B2) is a special case of (Bl). 
We refer to the rank of P as the number of particles. Note that for a fermionic 
projector, the number of particles is equal to the parameter /, whereas in case (Bl) 
the number of particles is only bounded from above by /. For brevity, we refer to the 
system {P, H,<.\.>) as a fermion system, and M are the space-time points. 

Since P has finite rank and <.|.> is non-degenerate, we can clearly represent P by 
a finite matrix, 

/ 

p^= J2'^k <</'/|V'> with V'fc, G H . 

k,l=l 

Writing the inner product with an integral (|3.4p . one sees that P has an integral 
representation, 

{P^P){x)= [ P{x,y)i^{y)dii{y) with P{x,y) £ L^{M x M,L{V)) (3.5) 

JM 

(where L(y) denotes the space of linear mappings from V to itself). We refer to P{x, y) 
as the kernel of the fermionic operator. Next, we introduce the closed chain A^y by 



A^y = P{x,y)P{y,x) 



(3.6) 
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Note that, for any fixed x and y, the closed chain is a symmetric linear mapping of V 
to itself. More generally, for any endomorphism ^ of y we define 

2n 

the spectral weight |^| = |A/| , (3-7) 

1=1 

where A/ are the eigenvalues of A, counted with algebraic multiplicities. We introduce 



2 I ^ \ A \2 



the Lagrangian C[An,y\ = \A^y \ - — \Axy 



(3.8) 



and define the functionals S and T by 



S[P] = / / C[A^y] dfi{x) dfi{y) (3.9) 

J J MxM 

T[P] =11 \A^y\^ dfiix) dfiiy) . (3.10) 

J J MxM 



These definitions are similar to ()2.3p - ()2.5p . However, we point out that now the 
mathematical setting is very different; in particular the functionals now depend on the 
fermionic operator P. Again using Definition 12.41 the eigenvalues Aj^ of the closed 
chain Axy induce on M a causal structure. Now we even get a connection to the usual 
notion of causality: If the fermionic operator describes vacuum Dirac seas, the causal 
structure of Definition 12.41 coincides precisely with the causal structure of Minkowski 
space (for details see [7, Section 6]). 

We prove the following existence results. 

Theorem 3.1. For given parameters /, n G N with f > 2n and a parameter v in the 
range (|2.8|) . we consider the variational principle 

minimize T[P] + zv5[P] 

within the class of all fermionic operators with the properties (A) and either (Bl) 
or (B2). Then the minimum is attained. 

Theorem 3.2. For given parameters fjUG'N with f > 2n and a constant C > 0, we 
consider the variational principle 

minimize S[P] on Vc ■= {P with T[P] < C} 

within the class of all fermionic operators with the properties (A) and either (Bl) 
or (B2). Then the minimum is attained. 

The method of proof is to consider the corresponding local correlation matrices (see 
Section [2^3]) ■ making it possible to apply the results of Chapter[2j In this reformulation, 
the conditions (Bl) and (B2) will correspond precisely to the conditions (CI) and (C2) 
on page [HI respectively. 

Before entering these constructions, we briefiy discuss the above theorems and put 
them in the context of previous work. In [6] similar existence results were obtained in 
the special case where M is a finite set and ^ the counting measure. These assump- 
tions are a major simplification because then the vector space H is finite-dimensional. 
The remaining difficulty is to handle the non-compact gauge freedom (13. 3p . and this 
problem is overcome by suitable a-priori estimates and a gauge fixing procedure. In 
order to avoid confusion of notation, we point out that in f6] the Lagrangian (j3.8p 
is referred to as the critical Lagrangian. Also, instead of minimizing the functional 
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T[P] + z^5[P], in [6] the action corresponding to the Lagrangian = — /i|j4p is 
considered for A* > These different functionals can easily be transformed into each 
other, and one sees that the procedure in [6] is equivalent to minimizing T[P] + i'S[P] 
for any u > 0. The procedure in the present paper has the advantage that we can also 
consider negative values of u. 

There are a few general differences between the results in [6] and the above theorems. 
First, in [6] the analog of Theorem 13.21 is proved under the constraint T[P] = C 
(instead of < C). This can be understood from the fact that for a finite counting 
measure, the bubbling phenomenon of Section 12.31 cannot appear (as is obvious from 
the inequality ()2.43p for atoms of our measure). Furthermore, it is worth noting that 
the following result for a finite counting measure (see [6l Theorems 2.5 and 2.9]) does 
not carry over to continuous measures. 

Theorem 3.3. Suppose that fi is a finite counting measure. Then the variational 
principle 

minimize S[P] 

attains its minimum if considered in one of the following two situations: 

(1) The fermionic operator satisfies (A) and (Bl). 

(2) The fermionic operator satisfies (A) and (B2), and P is homogeneous in the 
sense that Tr(P(x,x)) = Tr{P{y,y)) for all x,y £ M. 

The three-dimensional Dirac sphere (Example 12. 9p shows that this theorem does not 
hold for continuous measures, because there are divergent minimal sequences. Thus 
there are situations where minimizers exist only due to the discreteness of space-time. 

We finally point out that in [2j examples of minimizers are constructed for a small 
number of particles and space-time points, and the resulting causal structure is dis- 
cussed. 

3.2. Reformulation in Terms of Local Correlation Matrices. A direct ap- 
proach to proving Theorems 13.11 and 13.21 seems difficult because the inner product 
space {H, < . | . > ) is infinite-dimensional. In such infinite-dimensional indefinite inner 
product spaces (also called Krein spaces, see for example [1]), the functional analytic 
methods are quite limited, making it hard to control the behavior of minimizing se- 
quences (Pfe) of our variational principles. A particular problem in this setting is the 
above-mentioned gauge freedom (j3.3p . In order to bypass these difficulties, we shall 
proceed differently, making essential use of the fact that the operator P has finite rank. 
More precisely, our method is to choose convenient generators ipi, . . . ,ipf of the image 
of P and to consider the so-called local correlation matrices defined by 

{F^)i = -<i'j{x)\Mxy ■ (3.11) 

We shall see that the functionals S and T as well as the constraints (Bl) and (B2) 
can be reformulated purely in terms of the local correlation matrices. This remarkable 
fact will make it possible to apply the results of Chapter [2l giving us minimizing 
local correlation matrices. In Section [3.31 we will conclude the proof by constructing a 
corresponding fermionic operator. 

We first specify which generators "01) • • • , "0/ of P{H) we want to use in (|3.1ip . 
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Lemma 3.4. There are vectors tpi, . . . ,tpf G H such that 

f 

PiP = -^il^iKiPiltpy forallijeH. (3.12) 
1=1 

Proof. It is convenient to work in V with a fixed pseudo-orthonormal basis (ej) where 
the inner product has the standard representation with a signature matrix S, 

-<u\v>- = {u, Sv)^2! where S" = diag(l, . . . , 1, — 1, . . . , — 1) . (3.13) 

n times n times 

The signature matrix can be regarded as an operator on V . Alternatively, we can 
consider 5 as an operator on acting by pointwise multiplication (i.e. {Stp){x) := 
S{'il^{x))). This makes it possible to represent the inner product on H in terms of the 
standard scalar product by 

(3.14) 

Obviously, 5^ = 1 and S is symmetric with respect to both <.|.> and ( .|. )^2. 

As the operator (— -P) is symmetric and positive on (H, <.|.>), the operator {—SP) 
is symmetric and positive semi-definite on the Hilbert space {H, {■\-)]^2)- Using fur- 
thermore that the operator (—SP) has rank at most /, we can diagonalize this operator 
and choose orthonormal eigenvectors 4>i, . . . ,(j)f which span its image. Since {—PH) is 
positive semi-definite, the corresponding eigenvalues ki, . . . ,Kf are non-negative. Thus 
the operator (—PH) has the representation 

/ 

i-SP)ii;) = J2^i<t>i{<Pi\i^)L^ ■ 
1=1 

Multiplying by (— S) and using (13.14p together with the fact that 5^ = 1 and that 5 
symmetric with respect to {.\.) 1^2, we obtain a similar representation for P, 

f 

P,jj = -Y^KiS4>i<S^i\i;> . 
1=1 

Setting Tpi = ^Jki S4>i the result follows. □ 

We next rewrite the fermionic operator and the local correlation matrices in a com- 
pact form. Comparing (j3.12p with (j3.5p and (j3.4p . we find that 

/ 

P(x,y)i; = - ^ Vi(^) foralluGl/. (3.15) 

1=1 

Introducing the operators 

vr^ : F^C^ : v ^ (^'(/'/(a;)b^)i=i,...,/ 

/ 

ix ■■ : u h-^ -'^ui'ipi{x) , 

1=1 

the identities ()3.15p and (j3.1ip can be written in the simple form 



(3.16) 



P{X, y) = ix TTy 


V - 


> V 


Px — T^x ^x 


cf - 





(3.17) 
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The next lemma gives the comiection to the setting of Chapter [2j 

Lemma 3.5. (i) For every x G M , the local correlation matrix is Hermitian. 
It has at most n positive and at most n negative eigenvalues. 

(ii) In cases (Bl) or (B2) (on page the function F{x) = Fx satisfies the 
conditions (CI) and (C2) (on paae \18(l . respectively. 

(iii) Denoting the eigenvalues of the matrix F^Fy as in (j2.ip . the \^ , . . . , X^^ coin- 
cide precisely with the eigenvalues of the closed chain P{x,y) P{y,x), counted 
with algebraic multiplicities. 

Proof. It is obvious from the definition (|3.11|) that F^ is Hermitian. Also, the number 
of its positive and negative eigenvalues is bounded by the signature n of V. This 
proves (i). 

To prove (ii), we integrate ()3.11|) over x and use (13. 4|) to obtain 

[ iFx)ldfiix) = - <^j\ijk> . (3.18) 
Jm 

Taking the trace and using (I3.12p . we conclude that condition (Bl) implies (CI). 
If (B2) is satisfied, the rank of P equals /, and thus the vectors ipi, . . . ytpf in (I3.12|) 
are linearly independent. Taking the square of ()3.12p and using the idempotence of P, 
we obtain another representation of P, 

Pijj = ^'il^k<i^k\iPi><Hi^> ■ 

k,l 

Comparing these two representations in the basis ipi, . . . jipf of P{H), it follows that 

<ipj\ipk> = -Sjk ■ 

In view of ()3.18p . we conclude that the identity constraint (C2) holds. 

To prove (iii), we first recall that for quadratic matrices A and B, the products AB 
and BA have the same characteristic polynomials (see for example [6, Section 3]). If 
the matrices are not quadratic, i.e. A G Mat(C^,C''), B € Mat(C'^,C^) with p > q, we 
can extend the matrices by zero rows and columns to obtain quadratic p x p-matrices. 
Using the above result for quadratic matrices, it follows that 

detiAB - Xlcp) = AP~« det{BA - Al^) . (3.19) 

In words, the matrices AB and BA have the same eigenvalues and algebraic multiplic- 
ities, up to the zero eigenvalues whose multiplicity is obvious counting dimensions. 
We now write the product of the local correlation matrices according to (I3.17P as 

FxFy = AB where A = Lx and B = TTxiyiTy . 

Applying (I3.19p . we see that this matrix has (up to the obvious zero eigenvalues) the 
same eigenvalues and multiplicities as the matrix 

BA = TTxLy-Kyix = P{x, y) P{y, x) , 

where in the last step we used ()3.17p . This proves (iii). □ 

This lemma shows that considering the function F{x) = Fx, we are precisely in 
the setting of Chapter [2j In particular, Theorems 12.21 and 12.31 vield minimizing local 
correlation matrices. 
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3.3. Reconstruction of the Fermion System. The next lemma shows that the 
local correlation matrices can always be represented by a suitable fermionic operator P. 

Lemma 3.6. For any function F G L?'{M,3^ ,djjL) there is a kernel P E L'^{M x 
M,L{V)) such that for every x G M, the local correlation matrix (j3.17p coincides 
with F{x). The corresponding fermionic operator P satisfies the condition (A). If the 
conditions (CI) or (C2) (on paae W^) are satisfied, then the fermionic operator satisfies 
the conditions (Bl) or (B2) (on paae \33\) . respectively. 

Proof. For given x £ M we diagonalize the matrix F{x) by a unitary transformation U, 

F{x) = UDU-^ with U G U(C-^) and D = diag(i/i, ...,Uf). 

We order the eigenvalues such that (|2.6p holds and 1^2,1+1, . . . , z^/ = 0. Hence introduc- 
ing the matrices 

p = diag{y^\,...,^J\l^\) and s = diag(l, . , 1, -1, . , -1 , 0^^^^^), (3.20) 

n times n times f — 2n times 

we have the decomposition 

F{x) = -UpspU-^ . 
We now introduce the vectors '4'i{x), . . . , ipki^) £ V hy 

{^Pir = {pU-')f , a = l,...,2n, 

where in V we again work in the basis where the inner product has the form (j3.13p . 
Carrying out this construction for every x G M, we obtain functions 'ipi : M ^ 
V. Since F £ L?, its eigenvalues Vj are square integrable, and due to the square 
roots in (j3.20p and the unitarity of the transformation C7, the functions are in 
L^(M, V, dfj) C H. Hence ()3.12p defines a fermionic operator P with P{x., y) G L'^{M x 
M , L{y) , dn) . Furthermore, from the above construction it follows that 

(F,)f = -(V'fc(x),5V/(x))c2n = -^^p^{x),^i{x)^ , (3.21) 

and thus in view of (13. lip we see that the local correlation matrix F^ indeed coincides 
with the matrix F{x). 

Taking the trace of (|3.18p and using (j3.12p we obtain 



trP = - / Tr(F(x)) dfi{x) 
Jm 



and thus (CI) indeed implies (Bl). If (C2) holds, we obtain from ()3.18p that the 
vectors ipi,. . . ,ij)f are mutually orthogonal and normalized to <'0/|V'/> = ~1- Hence 
the fermionic operator (|3.12p is a projector on a negative definite subspace of H of 
dimension /. □ 

In view of this lemma. Theorem 13.11 and Theorem 13.21 follow immediately from 
Theorem 12.21 and Theorem 12.31 respectively. 

4. A Variational Principle in Infinite Space-Time Volume 

As explained at the beginning of Chapter [3l the variational principles so far were 
restricted in that the space-time volume and the number of particles had to be finite. 
We now introduce a class of variational principles in infinite space-time volume, which 
may involve an infinite number of particles. On the other hand, we need to specialize 
our setting by assuming that our system is homogeneous in the sense that the kernel of 
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the fermionic projector P{x, y) depends only on the difference vector y — x. This makes 
it necessary to assume that space-time has an underlying vector space structure, and 
thus this setting is of main interest in two situations: in the so-called continuous case 
we assume that (M, (.,.)) is Minkowski space and fi is the Lebesgue measure on M, 
whereas in the so-called discrete case we assume that (M, (.,.)) is a periodic lattice 
in Minkowski space (i.e. a discrete subgroup of W^) and fi is the counting measure. 
Generalizations of these two cases are clearly possible, but will not be considered here. 

The main simplification in the homogeneous setting is that P{x, y) can be written 
as a Fourier transform, i.e. in the usual physics notation 

where M is momentum space, which in the continuum case is isomorphic to Minkowski 
space, whereas in the discrete case M is a primitive cell of the reciprocal lattice. In 
order to obtain a nice measure-theoretic framework, we here combine the product 
P{p) d'^p into a Borel measure dv, taking values in L(V), where (V, -<-\->-) is again an 
indefinite inner product space of signature {n,n). In order to describe a completely 
filled Dirac sea in the vacuum, the measure dv would take the form (see for example [71 
Section 6]) 

di^ip) = {pj-f^ + m) d{{p,p) -m^) e(-p°) d^p , (4.1) 

where 7-^ are the Dirac matrices in Minkowski space, and ^ = endowed with the 
inner product ■^il)\(j)>- = if^cj), where ij^ = ^^7^^ is the adjoint spinor. This measure has 
the remarkable property that —v is positive in the sense that 

^v\{-u{VL))v'^ >^ forallt;Gy. (4.2) 

Unfortunately, measures of the form (j4.ip lead to ultraviolet problems. In order to 
avoid these problems, we shall assume that the measure dv is supported in a given 
bounded subset K C M (this is clearly no restriction in the discrete case, where 
already M is bounded). This motivates the following definition. 

Definition 4.1. Consider a regular Borel measure v on a hounded set K C M taking 
values in L(y) with the following properties: 

(i) For every v € V, the measure d-ivliyvy is a finite real measure. 

(ii) For every Borel set C K , the operator — z^(il) G L{V) is positive ()4.2p . 

Then v is called a negative definite measure on K with values in LiV). 

For a given negative definite measure we introduce the kernel of the fermionic 
operator by 

P{^)= f e*<P'^> di/(p) , where e^y-x. (4.3) 
Introducing the corresponding fermionic operator P by 

{Pip){y) = I Piy - x) ij{x) dfi{x) , 

J M 

this operator is well-defined for example on the test functions C^{M,V). We point 
out that this operator will in general have infinite rank, and thus (adopting the notion 
from Section [3.ip the total the number of particles is infinite. However, the quantity 

/loc := Tr(P(0)) (4.4) 
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is finite; it can be interpreted as the local particle density, which for our homogeneous 
system is constant in space-time. 

Following the procedure in Chapter [3l we introduce the closed chain by A{S,) = 
P(^)P(— ^) and define the spectral weight and the Lagrangian by (j3.7p and (j3.8p . 
respectively. However, the space-time integrals in p.9p and (|3.10p do not converge, 
because rewriting them as 



dfj,{x) d^{y) ■ ■ ■ = d^i{x) I d^i{i) ■ ■ ■ 
MxM JM JM 

and carrying out the ^-integral, our homogeneity assumption implies that the resulting 
function is constant, so that the x-integral necessarily diverges. This divergence is an 
artifact of working in infinite volume. Keeping in mind that in finite volume (obtained 
for example by compactifying space-time to a torus) the x-integral would give rise to 
an irrelevant constant, it is natural to simply drop the outer integral. This leads us to 
introduce the functionals S and T by 

s[u]= j c[A{i)]d^,{i) , T[u]= f i^(oP^iMe). (4.5) 

JM JM 

We are now ready to state the main result of this chapter. 

Theorem 4.2. Let {vk)k&n o, sequence of negative definite measures on the bounded 
set K C M satisfying one of the following two conditions: 

(I) Assume that (M, /i) is Minkowski space with the Lebesgue measure or a lat- 
tice with the counting measure. Assume furthermore that the functional T is 
bounded, 

T[uk] <C for allk£n. 

(II) Assume that (M, ^u) is a lattice with the counting measure. Assume furthermore 
that the functional S is bounded, 

< C forallk£N, 

and that the local particle density is bounded away from zero in the sense that 
there is a constant e > with 

Tr(P[z^fc](0)) > e for all k £ N . (4.6) 

Then there is a subsequence [i^ki) o-nd a series of unitary transformations Ui on V 
such that the measures Uiv^^JJ^^ converge in the {K)* -topology to a negative definite 
measure v with the properties 

T[i/] < liminfT[z^fc] , < liminf5[i^fc] . 

k k 

We note that, in contrast to the previous existence theorems, this theorem is stated as 
a compactness result. By applying it to a minimal sequence, one immediately obtains 
statements similar to those in Theorems 12.21 and 12.31 The advantage of the above 
compactness statement is that it applies immediately to situations where additional 
constraints are imposed (like further conditions on the support, the assumption of a 
vector-scalar structure, the condition of half-occupied surface states, etc). One only 
needs to ensure that these constraints are invariant under the unitary transforma- 
tions \Ji, and that they are continuous in the C''(-fir)*-topology. As a simple example, 
one may prescribe the local particle density (j4.4p . 
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Before coming to the proof, we illustrate the statement of the above theorem by a 
counter example, which shows why the support of the measures i^k must be uniformly 
bounded and why in case (I) one may not replace the functional T by the action S. 

Example 4.3. (The Dirac cylinder) We let (M, ( .|. )) be Minkowski space with n the 
Lebesgue measure. We choose n = 2 and {V, -<.\.y) as in the example (|4.ip . For given 
parameters r, L > 0, we consider in momentum space p = {uj,p) the measure 

du{p) = ^^L^M <5(|pf _ 1) ry;^/ + + ij d'^p . 

lovr L I J 

This measure is supported on the cylinder [—L,L] x 5^. Writing the square bracket 
as {l-^'jj + 1), the vector field / has Lorentz length one. Increasing r describes a Lorentz 
boost of this vector field. 

As is easily verified, is a negative definite measure. Computing its Fourier trans- 
form, we obtain 



P(^)= [ e'^P'^Uuip) 



1 sin Lt 



V-r2 + l7° + ir7V + l) / 6(\p\'^ - l) e'^P^d', 



ip 



Svr Lt 

1 sinLt / r-;; n \ siu If I 

4 Lt \ ^ ICI 

1 sinLt I / 7 n ^\ sinr ir£7 / sinr\ | 

= 4^|(-V^/ + l) ^ + ^(cosr---j|, 

where in the last line we set ?" = In particular, 

Tr(P(0)) = ^ Tr(-v/r2 + l70 + l) = 1 , 
so that the local particle density (|4.6p is fixed. A straightforward calculation yields 

and this diverges as r — > oo. The causal structure is more interesting because for 
large r, the timelike region shrinks to a small cylinder r < rmax- More precisely, a 
Taylor expansion near the origin yields for the two distinct eigenvalues X± of A{(^) 

_ ,_)^ = 16 (i + r^) (l - ^!(5±l!>) + 0(.-) . 

Hence the timelike region is given by 

3 f I 

r < rmax := - + ^ 
r \T-^ 

After verifying that in this region, the two eigenvalues A+ and A_ have the same sign, 
the action is computed to be 

. 37r2 1 , / I 



5 Lt ' \ 



+ . (4.8) 



Hence the action tends to zero as r -^■ oo. We conclude that Vk '■= t^\T=k is a minimizing 
sequence of the action S which does not converge. 
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It is also worth noting that, according to (j4.8p and ()4.7p . both functionals S and T 
become small if L tends to infinity. Hence dropping the condition suppz^^ C K, 
the sequence Uk '■= i'\L=k is a minimizing sequence which converges in the C^{M)*- 
topology to the trivial measure v = Q. 

A possible method for proving Theorem 14.21 would be to proceed as in Section 13.21 
by choosing generators of P{H) and considering the corresponding local correlation 
matrices. However, since the rank of P may be infinite, it now seems easier to analyze 
the kernel of the fermionic operator P{^) as an operator on the finite-dimensional 
indefinite inner product space {V, ~<.\.>-). This makes it possible to apply similar 
methods as in the previous chapters, which will be complemented by suitable estimates 
for negative definite measures. In V we again fix the pseudo-orthonormal basis (e^) in 
which the inner product ^.|.;^ has the representation (j3.13p . Furthermore, we denote 
the sup-norm of matrices in this basis by ||.|| and consider it as a norm on LiV). We 
say that a linear operator S on 1/ is positive if 

^u\Bu)~ > for all ti G y . 

Clearly, every positive operator is symmetric in V . However, in contrast to the situa- 
tion in scalar product spaces, not every positive operator is diagonalizable, as one sees 
in the simple two-dimensional example 

B = (_\ _\) , S=(^^ . 

But a positive operator can be diagonalized up to an arbitrarily small error term, as 
the next lemma shows. 

Lemma 4.4. Suppose that B is a positive linear operator on V . Then for every e > 
there is a unitary transformation U on {V,~<.\.y) such that 

UBU-^ = -diag(z.i, . . . , i/2n) + , 

where the real parameters Vi are ordered as in (12. 6p and 

\\AB\\ <£. 

Proof. Suppose that the characteristic polynomial of B has a root u ^ Q. Then there 
is a corresponding eigenvector n 7^ 0. Since is non-degenerate, there is a 

vector V with ~<v\u>- 7^ 0. Then 

-<v\Bu>- = V ~<v\u>- 7^ . 

Since B is positive, the bilinear form ~<.\B.>- is positive semi-definite. The correspond- 
ing Schwarz inequality (see [6l Lemma 4.1 (ii)] for details) 

^v\Bu>--'^ < -<u\Bu)~ ■-<v\Bv)~ (4.9) 

implies that 

< ■<u\Bu>- = v ^u\u>- . 

Hence u cannot be a neutral vector. Furthermore, the terms v and <u\u>~ have the 
same sign. Introducing the pseudo-normalized vector / = u/ ■<u\u>~ |, the orthogo- 
nal complement of / is an invariant subspace which does not contain u. Hence proceed- 
ing inductively, we can diagonalize B except on the invariant subspace corresponding 
to 1/ = 0. 
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Restricting attention to this remaining invariant subspace, the characteristic poly- 
nomial of B is trivial. Then B is nilpotent, and thus we can choose a basis where B 
is a direct sum of Jordan chains. Labeling the Jordan chains by an index c and the 
basis within each chain by f^^^ , ■ ■ ■ , Il^c) ' have 

Bfi'^ = and Bftl\ = f^'\ I G {1, . . . , L{c) - 1} . 

Let us verify that the inner products between the basis vectors have the properties 
that 

^ff^lfP^ = unless L(c) = L(c') and /' + / = L(c) + 1 . (4.10) 
To this end, we let c label the longest Jordan chain. Choosing an index I' < L{c'), the 
symmetry of the operator B yields that 

Hence the inner product -<fi^^\fi^ vanishes unless /' = L{c'). Next, from the calcu- 
lation 

^/i^^VS,^ = ^^"(^V^i)!/^,)^ = ^ftU\B''''ft'i>)^ = if L{<^) < L{c) 
and the fact that c is the longest Jordan chain, we conclude that 

^fi^fv'^"^ = unless /' = L(c') = L(c) =: L . 
Since ^.|.;^ is non-degenerate, we know that there is a Jordan chain c' such that the 
inner product '<fi^^\f^li)>~ is non-zero. If possible, we choose c' = c. Then all the 
inner products between the basis vectors of the chains c and c' can be computed as 
follows, 

and this vanishes unless I + 1' — L = 1. Hence ()4.10p is satisfied for the chains c and c'. 
Furthermore, one easily verifies that the subspace of V spanned by the vectors 
and ff is non-degenerate. Hence by going over to its orthogonal complement, we can 
proceed iteratively, proving (j4.10p . 

According to (j4.10p . all the inner products between the basis vectors remain un- 
changed if we rescale the basis vectors for any parameter p > according to 



But this transformation multiplies all matrix entries of i? by a factor p^^. Thus by 
choosing p sufficiently large, we can arrange that the matrix entries of B become 
arbitrarily small. We point out that the Jordan basis is not pseudo-orthonormal. But 
since the transformation matrix from the Jordan basis to the pseudo-orthonormal basis 
can be chosen independent of p, we can make the matrix entries of B arbitrarily small 
even in a pseudo-orthonormal basis. 

We have shown that there is a pseudo-orthonormal basis (fi) in which B is diagonal 
up to an arbitrarily small correction AB. Denoting the unitary transformation from 
the basis (/«) to our original pseudo-orthonormal basis (e^) by U (i.e. U{fi) = e^), the 
result follows. □ 

We next bound the Lagrangian at ^ = from below, using a method similar as in [6l 
Proposition 4.3]. 
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Lemma 4.5. For every negative definite measure /i, the corresponding Lagrangian 
(j4.5p satisfies at ^ = the inequality 

/:M(0)>^|P(0)pTr(P(0))^ 
where \.\ again denotes the spectral weight. 

Proof. According to Definition 14.11 (ii), the operator (— P(0)) = —i'{K) is positive. 
Thus applying Lemma 14.41 and taking the hmit e \ 0, we conclude that the spec- 
trum of P is real, and that we can order its eigenvalues Vi counted with algebraic 
multiplicities as in (|2.6p . The eigenvalues of A{Q) = P(0)^ are then obviously given 
by Aj = > 0. Hence rewriting the Lagrangian (13. Sp according to (|2.10p and choos- 
ing indices a,b £ {1, . . . , 2n} such that and — Wb\ are maximal, we obtain the 
estimates 

2n 2n 
i,j=l i,j=l 

2n n , 2n 



Tr(P(0))= £ |z.,|-^|^,| = i I; ^(|^,|_|^^|)<„(|^.J_|^.,|). 

A;=7i+1 1=1 k=n+l 1=1 

Combining these inequalities gives the result. □ 



Proof of Theorem \4-.S\ For ease in notation, we omit the brackets [v] and replace the 
arguments [v^] by an index, i.e. P^ = -P[i^fe], ^fc = ^[^^fc]; S = Slu], and so on. For 
any e > and A; € N, we apply Lemma 14.41 to the operator B = —V]^{K), which is 
positive according to (14. 2p . Replacing the measures by UukU^^, we conclude that 

MK) = diag(i^f \ . ..,4'"^) + Auk with IIAi^fcll < e (4.11) 

(where we again work in the pseudo-orthonormal basis (cj)). Our main task is to show 
that the measures Vk are uniformly bounded in the sense that there is a constant C > 
such that 

\Wk{^)\\ < C for all A: € N and all Borel sets ncK . (4.12) 

We first complete the proof of the theorem assuming this uniform bound, which we 
shall prove afterwards. If (j4.12p holds, the Banach-Alaoglu theorem and the Riesz 
representation theorem yield that a subsequence of the i^k converges in the C^{K)*- 
topology to a negative definite measure z^. Thus the integral (j4.3p converges pointwise, 

PkiO P{£.) for every ^ € M . 

As a consequence, the spectral weight |^fc(^)| converges pointwise. Using that the 
spectral weight is non-negative, we can apply Fatou's lemma to obtain 

T = [ hm |^fc(e)PA< lim / \Ak{0\^ d""^ = Urn Tk . 

Similarly, one sees that S < limfc_^oo Sk. This concludes the proof provided that (j4.12p 
holds. 
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To prove ()4.12p . we first make use of the positivity property (ii) in Definition 14. li 
Namely, this property ensures that for any Borel set ^1 C K, the bilinear form 
-<.|(— z^fc(r2)).;^ is positive semi-definite. Thus the Schwarz inequahty yields 

{^ei\{-Uk{n))ejy)'^ < ^ei\{-Uk{^))ei>~ ^ej\{-Uk{n))ejy . 

In other words, the off-diagonal diagonal matrix elements are bounded in terms of the 
diagonal entries, and thus it suffices to show that the diagonal entries -<ei\{—i'ki^))ei>- 
are bounded. Furthermore, in view of the fact that the measures -<ei\{—i'ki-))ei>' are 
all positive, it suffices to show that the total measure is bounded, i.e. 

I ■-<ei\i^kiK)eiy \ <C for all A; e N. 

Applying ()4.1ip and taking the limit e \ 0, this condition reduces to demanding that 
the spectral weight of Vk{K) be bounded, 

\vk{K)\<C for ah A: eN. (4.13) 

In case (II) when /u is a counting measure, it is obvious that £^(0) < S^- Thus 
Lemma 14.51 yields 

In view of (j4.6p and the uniform boundedness of the action, we thus have a uniform 
a-priori bound for the spectral weight of -Pfc(O). Using that Pa:(0) = Vk{K)i we ob- 
tain (ji38|l . 

In case (I) and if /i is the counting measure, the uniform boundedness of the func- 
tional T yields that the spectral weight 1^4^(0)1 is uniformly bounded. Since ^a:(0) = 
Pfc(O)^, it follows that the spectral weight of Pfc(O) is also uniformly bounded, prov- 
ing (14.13p . Thus it remains to consider case (I) for /i the Lebesgue measure. This is 
the most difficult case, because we need to bound the space-time integral (|4.5p from 
below by |P(0)|, making it necessary to estimate |^(C)I ™- a neighborhood of = 0. 
We introduce the quantity AP(^) = P(^) - P(0). Using (|43l) . we obtain 

d\\uk{p)\\<dmv^{k)U\\ \\uk\\{k), (4.14) 
where ||^|| is the Euclidean norm on M^, and the norm of the measure is defined by 

2n 

\H{k)=Y,\<e,\uejy\{k) 

(and \<ei\vej>-\ denotes the variation of a complex- valued measure, see [ini Sec- 
tions 28 and 29]). In order to estimate this norm, we first note that the Schwarz in- 
equality (14. 9|) allows us to estimate the off-diagonal elements | -<ei\vej>- \ {K) in terms 
of the diagonal elements. For the diagonal elements, on the other hand, we can use 
that — <ei\vei>- is a positive measure. Hence 

\\v{F)\\ <n\v{F)\ . 

Using this inequality in ()4.14p . we conclude that 

d\\yk{p)\\<ndmui{k)\\i\\ \vk{k)\ . 



|APfc(OII < 



|APfc(e)ii < 
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We choose ^ so small that 



U\\ < ^ ^ ■ (4.15) 

8n diam(i^) 



Then -Pfc(^) has the representation 

PkiO = Pk{0) + APkiO with ||APfc(e)|| < ^ |Pfe(0)| . 

Multiplying by a similar representation for using that, according to (j4.1ip . 
the matrix Pfc(O) is diagonal up to an arbitrarily small error term, we conclude that 

Akio = MO) + ^Mo with wAAkim < \ \Aum . 

Now AA can be treated as a perturbation. Using that the deviation of the eigenvalues 
is bounded by the sup-norm of the perturbation (see [ISi Chapter Two, §1]), we obtain 
the following estimate for the local trace, 

1^^(61 > \ \Am\ ■ 

Hence integrating over the ball M.lSh . we obtain the inequality 

Am? vr2 N-4 



Tfc > ^-^^ :- (^8ndiam(K) 

giving us the desired a-priori bound for the spectral weight |74fc(0)|. Again using 
that Afc(O) = Pfc(O)^, it follows that the spectral weight of Pfc(O) is uniformly bounded, 
proving (fiT3]) . □ 

Acknowledgments: I would like to thank Andreas Grotz, Joel Smoller, Daniela Schiefen- 
eder and the referee for helpful comments on the manuscript. 

References 

[1] J. Bognar, Indefinite inner product spaces, Springer- Verlag, New York, 1974, Ergebnisse der Ma- 

thematik und ihrer Grenzgebiete, Band 78. 
[2] A. Diethert, F. Finster, and D. Schiefeneder, Fermion systems in discrete space-time exemplifying 

the spontaneous generation of a causal structure. larXiv:07r6.4420i [math-ph]. Int. J. Mod. Phys. 

A 23 (2008), no. 27/28, 4579-4620. 
[3] F. Finster, The principle of the fermionic projector, hep-th/0001048| |hep-th/0 202059 | 

piep-th/0210121 , AMS/IP Studies in Advanced Mathematics, vol. 35, American Mathematical 

Society, Providence, Rl, 2006. 
[4] , The principle of the fermionic projector: an approach for quantum gravity?, 

[arXiv:gr-qc/0601128 , Quantum gravity (B. Fauser, J. Tolksdorf, and E. Zeidler, eds.), Birkhauser, 

Basel, 2006, pp. 263-281. 
[5] , Fermion systems in discrete space-time — outer symmetries and spontaneous symmetry 

breaking, arXiv:math-ph/0601039 , Adv. Theor. Math. Phys. 11 (2007), no. 1, 91-146. 
[6] , A variational principle in discrete space-time: existence of minimizers, 

|arXiv:math-ph/050306'9| Calc. Var. Partial Differential Equations 29 (2007), no. 4, 431- 

453. 

[7] , From discrete space-time to Minkowski space: basic mechanisms, methods and perspec- 
tives, arXiv:0712.0685 [math-ph], Quantum Field Theory (B. Fauser, J. Tolksdorf, and E. Zeidler, 
eds.), Birkhauser Verlag, 2009, pp. 235-259. 

[8] J. Glimm and A. Jaffe, Quantum physics, a functional integral point of view, second ed., Springer- 
Verlag, New York, 1987. 

[9] I. Gohberg, P. Lancaster, and L. Rodman, Indefinite linear algebra and applications, Birkhauser 
Verlag, Basel, 2005. 



CAUSAL VARIATIONAL PRINCIPLES ON MEASURE SPACES 



47 



[10] P.R. Halmos, Measure theory, Springer, New York, 1974. 

[11] S. Helgason, Groups and geometric analysis, Mathematical Surveys and Moiiograplis, vol. 83, 
American Mathematical Society, Providence, RI, 2000, Integral geometry, invariant diflferential 
operators, and spherical functions. Corrected reprint of the 1984 original. 

[12] R.A. Johnson, Atomic and nonatomic measures, Proc. Amor. Math. Soc. 25 (1970), 650 655. 

[13] T. Kato, Perturbation theory for linear operators. Classics in Mathematics, Springer- Verlag, 
Berlin, 1995, Reprint of the 1980 edition. 

[14] M. Reed and B. Simon, Methods of modem mathematical physics. I, functional analysis, second 
ed., Academic Press Inc., New York, 1980. 

[15] W. Rudin, Functional analysis, second ed.. International Series in Pure and Applied Mathematics, 
McGraw-Hill Inc., New York, 1991. 

[16] E.B. Saff and A.B.J. Kuijlaars, Distributing many points on a sphere, Math. Intelligencer 19 
(1997), no. 1, 5-11. 

[17] D. Serre, Matrices, Graduate Texts in Mathematics, vol. 216, Springer- Verlag, New York, 2002, 
Theory and applications. Translated from the 2001 French original. 

NWF I - Mathematik, Universitat Regensburg, D-93040 Regensburg, Germany 
E-mail address: Felix.Finster@mathematik.uiii-regeiisburg.de 



